Method for producing RNA viruses from cDNA

ABSTRACT

The present invention relates to methods for producing RNA virus cDNA, methods for producing viable, RNA virus and viable, RNA virus produced by those methods. The invention also related to a novel RNA virus cDNA, recombinant DNA molecules containing that cDNA and hosts transformed with those recombinant cDNA molecules. This invention further related to novel methods for screening for variants of a strain 3 poliovirus. This invention also related to methods for increasing the attenuation of a strain poliovirus. This invention provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus: culturing the host cell under conditions which permit the production of virus: and isolating the virus so produced, effective to immunize the subject, and a suitable carrier. Further provided to this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering the subject a suitable dose of the vaccine described hereinabove.

This application is the U.S. national stage application of the international application, PCT/US91/05890, filed Aug. 20, 1991, which is a continuation-in-part of U.S. Ser. No. 07/569,916, filed Aug. 20, 1990, now abandoned, and a continuation-in-part of U.S. Ser. No. 07/570,000, filed August 20, 1990, now abandoned.

BACKGROUND OF THE INVENTION

Throughout this application various references are referred to within parentheses or with arabic numerals within parenthesis. Full bibliographic citations for these publications referred to by arabic numerals may be found at the end of the specification immediately preceding the claims. The disclosures of these publications in their entireties are hereby incorporated by reference into this application to more fully describe the state of the art to which this invention pertains.

Human enteroviruses belonging to the family Picornaviridae are characterized by a single-stranded positive RNA genome. Members of this viral family include poliovirus, echoviruses, coxsackieviruses and rhinoviruses. Among these viruses, poliovirus has been the most extensively studied.

Poliovirus is known to be the causative agent of poliomyelitis, a paralytic disease of the central nervous system. This virus is known to exist in three stable serotypes--1, 2 and 3. For over 25 years, this disease has been controlled by the use of both the Sabin oral live-attenuated vaccine and the Salk inactivated virus vaccine. The Sabin vaccine consists of attenuated virus of each serotype, none of which are capable of causing disease. The strains used to produce the vaccine were created by a combination of extensive in vivo and in vitro passage of each of the three wild-type strains through monkey tissue. Upon oral administration, the live virus contained in the Sabin vaccine replicates in the gut, thereby inducing both systemic and local immunity. The killed virus (Salk) vaccine, which is administered intramuscularly, is limited to inducing systemic immunity.

Although the Sabin vaccine is considered to be a safe and effective protection against poliomyelitis, a small number of recipients have developed vaccine-associated the disease.

In an effort to understand the molecular basis of attenuation and reversion, the nucleotide sequences of cDNAs corresponding to each of the 3 attenuated strains and their wild-type progenitors, were compared [A. Nomoto et al., "Complete Nucleotide Sequence of the Attenuated Sabin 1 Strain Genome", Proc. Natl. Acad. Sci. USA, 79, pp 5793-97 (1982); G. Stanway et al., "Nucleic Acid Sequence of the Region of the Genome Encoding Capsid Protein VP1 of Neurovirulent and Attenuated Type 3 Polioviruses", Eur. J. Biochem., 135, pp. 529-33 (1983); G. Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12ab", Proc. Natl. Acad. Sci. USA, 79, pp. 1539-43 (1984); and H. Toyoda et al., "Complete Nucleotide Sequences of All Three Poliovirus Serotype Genomes", J. Mol. Biol., 174 pp 561-585 (1984)]. The observed differences in nucleotide sequence between each wild-type progenitor and its resultant attenuated strain were then further analyzed to determine their relationship to the phenomenon of attenuation.

In serotype 3, for example, the attenuated strain differed from the wild-type strain by only 10 point mutations [G. Stanway et al., (1984), supra]. Of these differences, only the changes at nucleotide positions 472 and 2034 were thought to be strongly associated with attenuation [D. M. A. Evans et al., "Increased Neurovirulence Associated With A Single Nucleotide Change In A Noncoding Region of the Sabin Type 3 Poliovirus Genome", Nature, 314, pp. 548-50 (1985); G. D. Westrop et al., "Genetic Basis of Attenuation of the Sabin Type 3 Oral Poliovirus Vaccine", J. Virol., 63, pp. 1338-44 (1989)].

Prior to the identification of the nucleotides which are linked to attenuation, it was demonstrated that cDNA synthesized from a viral RNA template ("RNA virus cDNA") could be utilized to produce viable poliovirus following transfection of mammalian cells [V. R. Racaniello et al., "Cloned Poliovirus Complementary DNA Is Infectious In Mammalian Cells", Science, 214, pp. 916-19 (1981)]. Such observations created the possibility of producing improved polio vaccines via genetic engineering techniques. This could be achieved by altering the cDNA around the crucial nucleotides so as to minimize reversion to the wild-type nucleotide, while maintaining structural and functional integrity of the virus.

Despite the discovery that RNA virus cDNA can be used to produce viable virus, it has never been demonstrated that these cDNA are accurate copies of the viral RNA present in wild-type or vaccine virus. Moreover, the use of cDNA sequences to determine which nucleotides are linked to attenuation, may have caused one or more critical sites to have been overlooked. This is because the process used to produce cDNA, namely reverse transcription, is known to be errorprone [I. M. Verma, "Reverse Transcriptase", In The Enzyme, Vol. 14, P. D, Boyer, ed., Academic Press, New York, pp. 87-104 (1981)].

Accordingly, a need still exists for the production of RNA virus cDNAs which are truly complementary to the vaccine virus RNA. Moreover, the use of inaccurate RNA virus cDNAs may result in reduced attenuation, if these cDNAs are ultimately to be used to produce vaccines, such as polio vaccines.

The genome of poliovirus is a single-stranded RNA molecule of plus-sense that is approximately 7500 nucleotides in length. The error frequency associated with replication of single-stranded RNA, as for poliovirus, is especially high compared to that of double-stranded DNA (3). Due to this inherent property, every preparation of poliovirus including the original Sabin (SO) strains must be considered genotypically heterogeneous.

Culture conditions (ie. temperature, cell substrate) as well as the homogeneity of the input virus are likely to influence which genotype predominates during amplication of a poliovirus sample. It is therefore not surprising that authorities who regulate the manufacture of OPVs (ie. FDA and WHO) dictate strict guidelines regarding the production of manufacturing seeds as well as the passage level of the seed represented in vaccine (22, 26). These regulations were put into action as an effort to minimize selection and amplification of less attenuated variant strains.

It has been well documented that the attenuated phenotype of the Sabin 3 strain is less genetically stable than the type 1 and 2 vaccine strains (4, 7, 11). In the past, a new manufacturing seed (RSO) was derived from the original Sabin 3 virus by selecting a plaque produced in Vervet monkey kidney cell monolayers from extracted infectious RNA (19). The isolate was chosen based on increased stability of its sensitivity to grow at 40.3° C. (rct marker) during serial passage as well as increased attenuation in monkeys. The sensitivity of growth at temperatures above 37° (rct marker) is still employed as an in vitro biological test to analyze the quality of vaccine strains (13).

A report by Kohara et al. (9) suggested that an infectious cDNA clone might be used to preserve the constancy and quality of the Sabin 1 seed. It is plausible that a similar approach could also benefit the attenuated type 3 strain. Until recently, the literature contained two cDNA sequences for Sabin 3 which differed at nucleotide positions (17, 21). The divergence between these sequences may be due to the fact that passage derivatives and clonal isolates of Sabin 3 rather than actual vaccine virus were used for making the cDNA clones.

SUMMARY OF THE INVENTION

This invention also provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; and isolating the virus so produced, effective to immunize the subject.

Further provided by this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering to the subject a suitable dose of the vaccine described hereinabove.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 depicts a chromatographic profile of first-strand cDNA synthesized from poliovirus strain 3 RNA off a Sepharose CL-4B column.

FIG. 2 depicts the cDNA clones which mapped to the P3/Sabin genome.

FIG. 3 depicts the strategy for assembling partial cDNAs into a single, full-length P3/Sabin cDNA according to this invention.

FIGS. 4A and 4B depict the strategy for mutagenizing P3/Sabin cDNA and constructing a true, full-length P3/Sabin cDNA according to this invention. Depiction of the strategy continues in FIG. 5.

FIG. 4C depicts the following. The sequence of pLED3 depiced in FIGS. 6A-6K (SEQ ID NO:1-2) was deduced from the complete analysis of pVR318 which was then altered at position 2493 by exchanging a SacI/HindIII fragment from the same subclone (pLL3-271) used to construct pVR318. However, confirmatory sequencing of the entire cDNA sequence in plasmid pLED3 revealed an additional "A" (adenosine) in a region where a run of six A's is normally found in the vital genome (position 4133-4138). The erroneous "A" was also found in subclone pLL3-271. The pLL3-271 and pBr318 were manipulated to make pLED3.2 which contains the correct number of A's at this site.

In order to reconstruct pLED3 with the correct number of A's, the SacI/HindIII fragment from pLL3-271 used above was cloned into bacteriophage M13 for oligonucleotide-directed deletion mutagenesis. An oligonucleotide spanning nucleotides 4121-4150 was synthesized for this purpose. The oligonucleotide has the sequence: 5'-CGCCTCAGTAAATTTTTTCAACCAACTATC-3' (SEQ ID NO:3)

Mutagenesis was performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. The mutagenized insert was called CB2 and was demonstrated to possess six A's at positions 4133-4138 as well as C at 2493 by sequence analysis. The corrected full-length construct was made by ligating the SacI/HindIII fragment of CB2 to the SacI/HindIII partial digestion fragment of pVR318 which had been used in the original pLED3 construction. The product of this ligation was called pLED3.2. The entire cDNA sequence of pLED3.2 has been verified to match the sequence reported in FIGS. 6A-6K.

FIG. 5 continues depiction of the strategy depicted in FIGS. 4A and 4B.

FIGS. 6A through 6K (SEQ ID NO:1-2) depict the nucleotide sequence of a true type 3 poliovirus vaccine strain cDNA according to this invention.

FIGS. 7A and 7B. FIG. 7A depicts the structure of the plasmid containing full-length LED3 cDNA and the T7 RNA polymerase promoter. The large shaded area represents the poliovirus cDNA. The small open area indicates the T7 promoter (pT7) and the start site for in vitro positive polarity transcripts of the cDNA. The plasmid is cut with PvuI (sites shown) before run-off transcripts are synthesized. FIG. 7B depicts RNAs transcribed from cDNA clones by purified T7 RNA polymerase. Portions of the transcription reaction mixture were analyzed by electrophoresis in a 0.6% agarose gel as described in Materials and Methods, Example 13. Lanes 1-3 represent 0.75 μg, 0.50 μg and 0.25 μg of 7.5 kb ssRNA marker, respectively. HindIII-digested phage lambda DNA is shown in lane 4. Lanes 5 & 6 demonstrate transcription reactions containing PvuI-digested pLED3 and pVR318 templates, respectively. RNA extracted from pelleted virions is shown in lane 7.

FIG. 8, depicts SDS-polyacrylamide gel electrophoresis of LED3 and VR318 cDNA-derived viruses. A [³⁵ S]methionine-labeled sample was prepared and loaded in each lane and resolved by eletrophoresis as described hereafter. The gel was dried and the protein bands visualized by autoradiography. LED3 virus in lanes "A"; VR318 virus in lanes "B". The positions of prestained molecular mass markers (kilodaltons) are indicated on the left. Positions of viral capsid proteins are identified on the right.

FIGS. 9A-9C depicts plaque phenotype of Leon (wild-type), VR318 and LED3 viruses on Vero cells. After incubation at 33.5° C. for 3 days under 1.0% nutrient agar, cells were stained with neutral red to visualize plaques.

FIG. 10, depicts kinetics of virus growth at 33.5° C. Vero cell monolayers were infected at an MOI of 4, and the extracellular medium was harvested at the indicated times post-infection. Plaque assays were used to determine the titer of infectious particles per ml as described in Materials and Methods, Example 13.

    ◯--◯, LED3; .increment.- - - - - -.increment., VR318.

FIG. 11, depicts thermal stability of LED3 and VR318 virus at various temperatures. Virus samples containing approx. 10⁷.3 pfu/ml were incubated at 22° C. (circle), 37° C. (square) and 42° C. (triangle). Samples were periodically removed and titer of infectious virus determined by plaque assays. Open markers, LED3; filled markers VR318.

DETAILED DESCRIPTION OF THE INVENTION

This invention provides a method for producing a true RNA virus cDNA comprising the steps of (a) isolating genomic RNA from an RNA source virus; (b) employing RNA sequencing means to determine the nucleotide sequence of a portion of the isolated genomic RNA; (c) employing cDNA synthesis means to produce a double-stranded cDNA from the isolated genomic RNA; (d) employing DNA sequencing means to determine the nucleotide sequence of a portion of the cDNA, wherein the portion of the cDNA corresponds to the portion of the RNA sequenced in step (b); (e) comparing the sequenced cDNA with the sequenced RNA to determine substantive differences in nucleotide sequence; and (f) altering the substantive differences in the cDNA to produce a true RNA virus cDNA.

This invention also provides a method for producing an RNA virus cDNA comprising the steps of a) isolating genomic RNA from an RNA source virus, b) employing RNA seqencing means to determine the nucleotide sequence of a portion of said isolated genomic RNA, c) employing cDNA synthesis means to produce a double-stranded cDNA from said isolated genomic RNA, d) employing DNA sequencing means to determine the nucleotide sequence of a portion of said cDNA, wherein said portion of said cDNA corresponds to said portion of said RNA sequenced in step b), e) comparing said sequenced cDNA with said sequenced RNA to determine substantive differences in nucleotide sequence, and altering said substantive differences in said cDNA to produce an RNA virus cDNA.

This invention also provides the methods described hereinabove, wherein the RNA source virus is a Picornavirus, such as a vaccine strain 3 poliovirus.

In one embodiment of the invention, the portion of RNA sequenced in step (b) comprises nucleotide 2493 of a vaccine strain 3 poliovirus. In a further aspect of this embodiment, the RNA consists of about 100 to 200 nucleotides. Alternatively, the nucleotide sequence of the entire isolated viral RNA is determined in step (b).

This invention further provides a true RNA virus cDNA or an RNA virus cDNA produced by the methods described hereinabove. For example, a true RNA virus cDNA or an RNA virus cDNA may be produced which is derived from a vaccine strain 3 poliovirus, the cDNA being selected from that contained in a novel plasmid designated pLED3.2 or pLED3, respectively or cDNAs which code on expression for the polypeptides coded on for expression by pLED3.2 or pLED3, respectively and the recombinant DNA molecule produced thereby.

The recombinant DNA molecule can be operatively linked to a promoter of RNA transcription. Suitable promoters include, but are not limited to the T7 promoter.

A host transformed with a recombinant DNA molecule described hereinabove is also provided by this invention, wherein the host is selected from the group consisting of bacteria, such as E. coli, yeast and other fungi, insect cells and animal cells. Suitable animal cells include, but are not limited to Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells.

Further provided by this invention is a method for producing a viable RNA virus comprising the steps of: (a) culturing a host described hereinabove under conditions which permit the production of viable RNA virus; and (b) harvesting the viable RNA virus from the host cell culture.

This invention provides a method of producing a viable RNA virus comprising the steps of: (a) employing in vitro transcription means to produce RNA from a recombinant DNA molecule described hereinabove (b) isolating the RNA; (c) transfecting a host with the isolated RNA, wherein the host is an animal cell; (d) culturing the host under conditions which permit the production of viable RNA virus; and (e) harvesting the viable RNA virus from the host cell culture. For example, vaccine strain 3 poliovirus may be used and the host may be a primary monkey kidney cell.

This invention further provides a method of producing a viable RNA virus comprising the steps of: (a) transfecting a host with RNA virus cDNA, wherein the cDNA is selected from that contained in the plasmid pLED3 or pLED3.2 or cDNAs which code on expression for the polypeptides coded on for expression by pLED3 or pLED3.2, wherein said host is an animal cell; (b) culturing said host under conditions which permit the production of viable RNA virus from said host cell culture.

This invention further provides an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced.

In one embodiment of this invention, the RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced.

In one embodiment of this invention, the RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the vaccine strain 3 poliovirus.

A method of screening for variants of a strain 3 poliovirus is also provided which comprises the steps of: (a) isolating genomic RNA from the poliovirus; (b) employing RNA sequencing means to determine the nucleotide at position 2493.

This invention also provides a method for increasing the attenuation of a strain 3 poliovirus encoded by an RNA virus cDNA, wherein the cDNA comprises the nucleotide sequence ATT at positions 2492 to 2492, the method comprising the step of mutagenizing the cDNA at nucleotide 2493 to change the T to C. In one embodiment of the invention, the method can further comprise the step of mutagenizing the cDNA at nucleotide 2494 to change the T to a nucleotide selected from the group consisting of A, C and G.

This invention provides a vaccine useful for immunizing a subject, for example a human, against infectious poliovirus, wherein the vaccine comprises an effective amount of an RNA virus, produced by transforming a suitable host cell with a recombinant nucleic acid sequence which encodes for the virus; culturing the host cell under conditions which permit the production of virus; and isolating the virus so produced, effective to immunize the subject, and a suitable carrier. The RNA virus may be a vaccine strain 3 poliovirus and the recombinant nucleic acid sequence is a recombinant infectious full length nucleic acid sequence which encodes for the vaccine strain 3 poliovirus.

The recombinant nucleic acid sequence which encodes for the RNA virus may be a DNA sequence, or an RNA sequence or in the preferred embodiment of this invention, the recombinant nucleic acid sequence is a cDNA sequence. Suitable cDNAs are the plasmids designated pLED3.2 or pLED3, each of which contains a promoter linked cDNA nucleic acid sequence which encodes for the vaccine strain 3 poliovirus. The plasmid pLED3 was deposited with the American Type Culture Collection (ATCC), located at 12301 Parklawn Drive, Rockville Md., 20852, under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedure. The plasmid pLED3 was assigned Accession No. 40789. The plasmid pLED3.2 also was deposited with the ATCC under the provisions of the Budapest Treaty and was assigned Accession No.

Suitable host cells include, but are not limited to bacteria, such as E. coli, yeast and other fungi, insect cells and animal cells.

Suitable animal cells include, but are not limited to Vero cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells.

The suitable carrier may be a physiologically balanced culture medium, such as aline containing stabilizing agents, for example, dextrose and lactose, or other nontoxic substances. These vaccines may also be formulated with a suitable adjuvant such as alum. For methods of vaccine preparation, see J. I. Duffy, Vaccine Preparation Techniques, Noyes Data Corporation (1980), and G. W. Warr, "Preparation of Antigens and Principles of Immunization", in J. J. Marchalonis and G. W. War. eds., Antibody As A Tool--The Applications of Immunochemistry, pp. 21-58, John Wiley & Sons (1982).

The RNA virus may also be desiccated, e.g., by freeze drying for storage or for subsequent formulation into liquid vaccines.

Further provided by this invention is a method of immunizing a subject such as a human against infectious poliovirus, wherein the method comprises administering to the subject a suitable dose of the vaccine described hereinabove. Suitable methods of administering vaccines are well known to those of ordinary skill in the art. However, by way of example, such methods may include but are not limited to intramuscular, intravenous, subcutaneous, intratracheal or intranasal administration.

Additionally, the effective immunizing amount is an amount which is necessary to invoke the production of antibodies by the subject thereby conferring protection on the subject against infectious poliovirus or poliomyelitis.

Throughout this application, references to specific nucleotides in cDNA molecules are to nucleotides present on the coding strand of the cDNA, i.e., the strand which has a sequence equivalent to the position RNA strand of an RNA virus. References to specific nucleotide position numbers in strain 3 poliovirus follow the nucleotide numbering system of G. Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12alb", Proc. Natl. Acad. Sci. U.S.A., 81, pp 1539-43 (1984). The following standard abbreviations are used throughout the specification and in the claims to indicate specific nucleotides:

    ______________________________________                                         C - cytosine        A - adenosine                                              T - thymidine       G - guanosine                                              U - uracil                                                                     ______________________________________                                    

The term "source virus" refers to the RNA virus from which RNA is isolated and used as a template for cDNA production. And the term "increased attenuation" is used throughout to indicate a lower rate of reversion to the neurovirulent phenotype than those of conventional vaccine virus strains.

The term "true RNA virus cDNA", as used herein, refers to a cDNA which directs the production of a viable RNA virus that is phenotypically similar to the source virus. Accordingly, the present invention encompasses cDNA molecules which, by virtue of the redundancy of the genetic code, are characterized by a nucleotide sequence that differs from that of the source virus RNA, but which encode polypeptides having the same amino acid sequences as those encoded by the source virus RNA. The invention also encompasses cDNAs which encode amino acid sequences which differ from those of the source virus polypeptides, but which do not produce phenotypic changes. Hereinafter, these altered, but phenotypically equivalent amino acid sequences are referred to as "equivalent amino acid sequences." And this invention encompasses cDNA molecules characterized by changes in noncoding regions that do not alter the phenotype of the RNA virus produced therefrom when compared to the source virus. Differences between the nucleotide sequence of an RNA virus cDNA and the source virus RNA which result in phenotypical differences in the virus produced therefrom are hereinafter referred to as "substantive differences".

This invention provides a method for producing an RNA virus cDNA comprising the steps of (a) isolating genomic RNA from an RNA source virus, (b) employing RNA sequencing means to determine the nucleotide sequence of a portion of the isolated genomic RNA, (c) employing cDNA synthesis means to produce a double-stranded cDNA from the isolated genomic RNA, (d) employing DNA sequencing means to determine the nucleotide sequence of a portion of the cDNA, wherein the portion of the cDNA corresponds to the portion of the RNA sequenced in step (b), (e) comparing the sequenced cDNA with the sequenced RNA to determine substantive differences in nucleotide sequence, and (f) altering the substantive differences in the cDNA to produce an RNA virus cDNA. In one embodiment of this invention, the RNA virus is a true RNA virus and in another embodiment of this invention the RNA virus is a "phenotypically equivalent" virus. In one embodiment of this invention, the RNA source virus is a Picornavirus, for example a vaccine strain 3 poliovirus.

This invention also provides a method as described hereinabove wherein the portion of RNA sequenced in step (b) comprises nucleotide 2493 of a vaccine strain 3 poliovirus, or preferably, wherein the RNA consists of about 100 to 200 nucleotides.

In another embodiment of this invention, the method described hereinabove wherein the nucleotide sequence of the entire isolated viral RNA is determined in step (b).

The method of this invention may also be used to produce an RNA virus cDNA by altering nucleotides in various regions of the poliovirus genome, for example, to correct mutations introduced into the region coding for the amino terminus of VP1 of the type 1 Mahoney poliovirus [K. Kirkegaard, "Mutations in VP1 of Poliovirus Specifically Affect Both Encapsulation and Release of Viral RNA", J. Virology, 64, pp. 195-206 (1990)].

Further provided by this invention is an RNA virus cDNA produced by any of the methods described hereinabove which may include, but is not limited to an RNA virus cDNA derived from a vaccine strain 3 poliovirus, the cDNA being selected from the group consisting of pLED3 or pLED3.2 and cDNAs which code an expression for the polypeptides coded on for expression by pLED3 or pLED3.2.

This invention also provides a recombinant DNA molecule comprising an RNA virus cDNA as described hereinabove, which may include, but is not limited to the recombinant DNA molecule wherein the RNA virus cDNA is operatively linked to a promoter of RNA transcription such as the T7 promoter.

A host transformed with a recombinant DNA molecule described hereinabove is also provided wherein the host is selected from the group consisting of bacteria, yeast and other fungi, insect cells and animal cells. In a preferred embodiment of this invention, the host is an animal cell and is selected from the group consisting of Veto cells, HeLa cells, COS cells, CV-1 cells and primary monkey kidney cells. In another embodiment of this invention, the host is E. coli.

This invention further provides a method for producing a viable RNA virus comprising the steps of: culturing a host described hereinabove under conditions which permit the production of viable RNA virus and harvesting the viable RNA virus from the host cell culture. This invention also provides a method of producing a viable RNA virus comprising the steps of employing in vitro transcription means to produce RNA from a recombinant DNA molecule described hereinabove, isolating the RNA, transfecting a host with the isolated RNA, wherein the host is an animal cell, culturing the host under conditions which permit the production of viable RNA virus and harvesting the viable RNA virus from the host cell culture. In the preferred embodiment of this invention, the RNA virus is a vaccine strain 3 poliovirus and the host is a primary monkey kidney cell.

A method of screening for variants of a strain 3 poliovirus is provided by this invention which comprises the steps of isolating genomic RNA from the poliovirus, and employing RNA sequencing means to determine the nucleotide at position 2493.

This invention also provides a method for increasing the attenuation of a strain 3 poliovirus encoded by an RNA virus cDNA, wherein the cDNA comprises the nucleotide sequence ATT at positions 2492 to 2494, and the method comprising the step of mutagenizing the cDNA at nucleotide 2493 to change the T to C. This method may further comprise the step of mutagenizing the cDNA at nucleotide 2494 to change the T to a nucleotide selected from the group consisting of A, C and G.

The determination of phenotypic differences may be carried out by several methods which are well known in the art. Preferably, a true RNA virus cDNA of an attenuated strain 3 poliovirus encodes a virus which has the same degree of attenuation as the source virus. Several well characterized markers can be used to determine phenotypic changes in a strain of poliovirus. These are the "d" markers, which regulate the ability of the virus to grow under acid conditions [M Vogt et al., "Mutants of Poliomyelitis Viruses with Reduced Efficiency of Plating in Acid Medium and Reduced Neuropathogenicity", Virology, 4, pp 141-55 (1957)]; and the "rct₄₀ " marker, which regulates the ability of the virus to grow at elevated temperatures [A. Lwoff, "Factors Influencing the Evolution of Viral Diseases at the Cellular Level and in the Organism", Bact. Rev., 23 pp. 109-24 (1959)].

The most preferred method of determining phenotypic changes between an attenuated strain 3 source poliovirus and the virus produced from its true RNA virus cDNA is a comparison of neurovirulence. Several protocols for assessing neurovirulence are known in the art [Code of Federal Regulations, Title 21, Chapter 1, pp. 91-93 (Apr. 1, 1987 edition)].

Production of a True RNA Virus cDNA

According to one embodiment, the present invention relates to a method for producing true RNA virus cDNA. The production of a true RNA virus cDNA according to this invention may employ any RNA source virus--positive single-stranded, negative single-stranded, or double-stranded. Preferably, the source virus is a positive single-stranded virus. More preferably, the virus is a human enterovirus belonging to the family Picornaviridae. Most preferred is an attenuated type 3 vaccine strain poliovirus (also referred to herein as "P3/Sabin").

A. Proliferation and Isolation of Virus

The initial step in the production of true RNA virus cDNA involves the proliferation and purification of the source virus and the isolation of RNA therefrom. Techniques for proliferating and isolating virus are well known in the art [R. J. Kuchler, "Biochemical Methods in Cell Culture and Virology", Dowden Hutchinson and Ross, Inc., Stroudsburg, Pa. (1977)]. It will be understood that the method of viral growth, including choice of host cell, selection of growth medium, and conditions of growth, will differ depending upon the particular source virus. In a preferred embodiment, attenuated strain 3 poliovirus is proliferated in primary monkey kidney cells according to known methods [A. Sabin et al., "Studies On Variants Of Poliomyelitis Virus", J. Exp., Med., 99, pp. 551-76 (1954)]. The following procedures are applicable for any strain of poliovirus. However, the use of an alternate RNA source virus in these procedures may require certain virus-specific modifications which are known to those of skill in the art.

Viral proliferation of the RNA virus is allowed to proceed to a point where a sufficient quantity of virus can be harvested for RNA isolation. The culture media containing the source virus is then collected and cellular debris is removed, preferably by centrifugation at a speed which will not pellet the virus. This is typically about 2,500 rpm for about 20 minutes. The supernatant may then be further purified by ultrafiltration employing a filter having a pore size that is larger than the viral particles. Preferably, a filter of approximately 0.22 μM is used.

Following filtration, the viral particles are collected by polyethylene glycol precipitation followed by centrifugation or, more preferably, by high speed centrifugation at about 70,000 rpm. The viral particles are then resuspended in a small volume of buffer, preferably TNE (10 mM Tris-HCl, 100 mM NaCl, 1 mM EDTA, pH 7.4). A non-ionic detergent may optionally be added to the viral particle suspension to dissolve any contaminants. Although the high speed viral pellet is sufficiently pure to use as a source of viral RNA the viral suspension may optionally be further purified by sucrose density gradient centrifugation.

If density gradient centrifugation is employed, fractions are collected from the gradient and analyzed for the presence of source virus. Any conventional assay which detects source virus-specific proteins may be employed. Such assays include, for example, Western blots, ELISA, radioimmunoassay, or polyacrylamide gel electrophoresis and comparison to a source virus standard. The latter technique is most preferred because it is the most economical.

B. Isolation and Sequencing of Viral RNA

Once the virus is purified as described above, viral RNA may then be isolated. This is achieved by first dissociating the viral capsid proteins by treatment with detergent, preferably sodium dodecyl sulfate ("SDS") at a final concentration of 0.5%. The dissociated proteins are then extracted by treatment of the sample with organic solvents. Extraction is preferably achieved with a phenol:chloroform:isoamyl alcohol mixture. The RNA present in the aqueous phase may then be isolated by any method well known in the art [T. Maniatis, "The Molecular Guide To Cloning", Cold Spring Harbor Press (1983)]. Preferably, the viral RNA is precipitated with 0.5 volumes of 7.5M ammonium acetate and 2.5 volumes of ethanol at -20° C. Quantitation and integrity of the viral RNA may be determined by agarose gel electrophoresis. It should be noted, as is well known in the molecular biology art, that great care must be taken in the preparation and handling of RNA samples due to the prevalence of RNases. Methods for inactivating RNases that may be present in reagents and in vessels used in RNA preparation are well known [T. Maniatis, supra].

Once the source virus RNA has been isolated, it is subjected to dideoxy nucleotide sequencing [F. Sanger et al., "DNA Sequencing With Chain-Terminating Inhibitors", Proc. Natl. Acad. Sci. USA, 74:5463-67 (1977)] employing modifications for RNA [D. C. Deborde et al., "Resolution Of A Common RNA Sequencing Ambiguity By Terminal Deoxynucleotidyl Transferase", Anal. Biochem., 157, pp. 275-82 (1986)]. According to a preferred embodiment of this invention, the entire RNA genome of the virus is sequenced and compared to the cDNA sequence by methods which are hereinafter described. In this embodiment of the invention the source virus is most preferably an attenuated type 3 vaccine strain poliovirus.

C. Synthesis and Screening of an RNA Virus cDNA Library

Following RNA sequencing, the source virus RNA is used as a template for the synthesis of a full-length, double-stranded cDNA. Any well-known method or commercially available cDNA synthesis kit is employed to synthesize cDNA. Preferably, cDNA is synthesized by the method of V. R. Racaniello et al., "Molecular Cloning Of Poliovirus cDNA And Determination Of The Complete Nucleotide Sequence Of The Viral Genome", Proc. Natl. Acad. Sci. USA, 78, pp. 4887-91 (1981). For ease of detection, the first strands of cDNA may optionally be radiolabeled by employing a radioactive nucleotide during cDNA synthesis. Once synthesized, the single-stranded cDNA are preferably size-fractionated either by agarose gel electrophoresis or, more preferably, by gel chromatography. The larger cDNAs are isolated and used as templates for second-strand synthesis. Double-stranded cDNA is then size-fractionated as described above and the largest molecules are used for the creation of a source virus cDNA library. The cDNA are then tailed, either by the olio dG/dC method or by the addition of restriction enzyme linkers, and cloned into an appropriate vector. The choice of vector will be based upon the technique that will be employed to screen the library. For example, the use of an immunoscreening technique requires that the cDNA be inserted into an expression vector, such as lambda gt11 (ATCC accession number 37194). If a hybridization screening method is employed, vectors such as bacterial plasmids are most convenient. Preferably, the cDNA are tailed by the olio dG/dC method and cloned into the PstI site of pBR322.

Once the RNA virus cDNA library is created, it is screened for a full-length cDNA clone. This may be achieved by well-known screening methods, such as antibody screening or hybridization to a labeled probe. Most preferably, the library is screened by colony hybridization using virus-specific cDNA probes [M. Grunstein et al., "Colony Hybridization: A Method for the Isolation of Cloned DNAs That Contain A Specific Gene", Proc. Natl. Acad. Sci. USA, 72, pp. 3961-65 (1975)], based on known nucleotide sequences or amino acid sequences of the virus. Once a clone containing an RNA virus cDNA has been identified and isolated, it may be removed from the vector and analyzed to determine whether it represents a full-length RNA virus cDNA. In a preferred embodiment of the invention, the dC-tailed cDNA is removed from a Pst I cut, dG-tailed vector by digestion with PstI. Partial cDNAs may be used to reprobe the library and to locate longer, or full-length cDNAs. If no full-length cDNAs can be detected, several overlapping partial cDNAs representing the entire source virus genome may be ligated together at common restriction sites to produce a full-length cDNA [V. R. Racaniello et al., "Cloned Poliovirus cDNA Is Infections In Mammalian Cells", Science, 214, pp. 916-19 (1981)]. Any portion of the viral genome which is not represented by an isolated cDNA may be synthesized using standard oligonucleotide synthesizing techniques and subsequently ligated into its proper position to form a full-length cDNA.

D. Sequencing and Alteration of cDNA to Correspond to Source Virus RNA

Portions of the full-length cDNA corresponding to the sequenced portion of the source virus RNA are then sequenced by standard DNA sequencing methods. The sequenced regions of the viral RNA and the RNA virus cDNA are then compared. Theoretically, the cDNA should correspond exactly to the RNA which served as its template. However, it is known that reverse transcriptase can produce errors when transcribing cDNA from RNA [I. M. Verma, "Reverse Transcriptase", In The Enzymes, Vol. 14, P. D. Boyer, ed., Academic Press, New York, pp. 87-104 (1981)]. Therefore, according to the method of this invention, it is necessary to alter the nucleotide sequence of the cDNA so that it corresponds to the sequenced RNA.

The present invention contemplates altering the cDNA sequence at those sites which are responsible for phenotypic changes. Accordingly, portions of the cDNA which encode polypeptides having equivalent amino acid sequences as those encoded by the source virus RNA need not be altered.

Preferably, any cDNA nucleotide mutation which may potentially affect virus production or viral polypeptide synthesis should be altered to correspond to the source virus RNA.

Methods for altering the nucleotide sequence of a cDNA molecule are known in the art and include site-directed mutagenesis [C. A. Hutchinson, III et al., "Mutagenesis At A Specific Position In A DNA Sequence", J. Bio. Chem., 253, pp 6551-60 (1978); A. Razin et al., "Efficient Correction Of A Mutation By Use Of A Chemically Synthesized DNA", Proc. Natl. Acad. Sci., USA, 75, p. 4268 (1978)]. Alternatively, a partial cDNA clone containing the desired sequence may be isolated from the cDNA library and its DNA, or a portion thereof, substituted in the full-length clone for the sequences which are to be altered. Once the cDNA sequence has been altered, it is utilized in other embodiments of this invention.

According to another embodiment of the present invention, the true RNA virus cDNA may be inserted into an appropriate vector and used to transform an appropriate host. The choice of vector will depend upon the ultimate intended use of the cDNA. Similarly, the choice of host will depend upon both the vector selected and the ultimate goal of transformation.

For example, if it is desirable to simply store the cDNA and create and unlimited supply thereof, the cDNA will be inserted into a vector which is capable of transforming a unicellular organism, such as a bacteria, yeast or other fungi, an animal cell or an insect cell. According to one embodiment of this invention, the cDNA is inserted into the PstI site of pBR322 and the host to be transformed is E. coli.

According to another embodiment of the invention, the true RNA virus cDNA may be operatively linked to a promoter of transcription. As used herein, the term "operatively linked" means positioned in such a manner that the promoter will direct the transcription of RNA off of the true virus cDNA. Examples of such promoters are SP6, T4 and T7. The most preferred promoter is the T7 promoter [J. J. Dunn et al., "Complete Nucleotide Sequence of Bacteriophage T7 DNA and the Locations of T7 Genetic Elements", J. Mol. Biol., 166, pp. 477-535 (1983)]. Vectors which contain both a promoter and a cloning site into which an inserted piece of DNA is operatively linked to that promoter are well known in the art. Preferably, these vectors are capable of transcribing RNA in vitro. Examples of such vectors are the pGEM series [Promega Biotec, Madison, Wis.].

According to a further embodiment, the present invention relates to methods for producing viable positive stranded RNA virus. This may be achieved by transfecting an appropriate host with a true RNA virus cDNA. Any standard method of transfecting animal cells with DNA may be employed [F. M. Ausubel et al., "Current Protocols in Molecular Biology", Greene Publishing Associates & Wiley Intersciences (1987)]. The host is then cultured under conditions conductive to the production of RNA virus. Such conditions are well known in the art and will vary depending upon the virus to be produced. Similarly, the choice of host cell should be one which is compatible with the virus, preferably primate cells in culture. In a preferred embodiment of the invention, wherein the true RNA virus cDNA encodes an attenuated strain 3 poliovirus, the host is selected from the group consisting of Vero cells, HeLa cells, COS cells, CV-1 cells, human diploid cell lines, such as WI-38 and MRC5, and primary monkey kidney cells. The most preferred hosts are monkey kidney cells. Once the cells have produced a desirable level of virus, the virus is harvested from the cell culture according to standard protocols.

According to an alternative embodiment of this invention, viral RNA, which is produced by in vitro transcription of a true RNA virus cDNA according to the invention, may be employed in methods for producing viable RNA virus. The in vitro transcribed RNA is isolated by standard methods and used to transfect an appropriate host. The use of RNA to transfect cells is known in the art [S. van der Werf et al., "Synthesis of Infectious Poliovirus RNA by Purified T7 RNA Polymerase", Proc. Natl. Acad. Sci. USA, 83, pp. 2330-34 (1986)]. Once the cells are transfected, they are grown and the virus harvested according to standard protocols.

According to another embodiment, this invention relates to a method of screening for variants of a strain 3 poliovirus. Through RNA sequencing, it was discovered that the presence of a C at nucleotide position 2493 of this virus is linked to the attenuated strain 3 genotype. Previous analyses of this strain failed to recognize the importance of this position in attenuation due to a combination of two factors: the sequences of wild-type and attenuated strain 3 poliovirus were compared on the cDNA level, rather than the genomic RNA level; and some of these cDNAs were made from plaque isolates of the original viral samples [Stanway et al., "Comparison of the Complete Nucleotide Sequences of the Genomes of the Neurovirulent Poliovirus P3/Leon/37 and its Attenuated Sabin Vaccine Derivative P3/Leon 12a1b", Proc. Natl. Acad. Sci. USA, 81, pp. 1539-43 (1984); H. Toyoda et al., "Complete Nucleotide Sequences of All Three Poliovirus Serotype Genomes", J. Mol. Biol., 174, pp. 561-85 (1984)]. As a result of either errors in reverse transcription or mutations resulting from virus passaging, the cDNAs previously produced contained a T at position 2493. Therefore, the RNA of type 3 vaccine strain of poliovirus was mistakenly believed to contain U at this position, the same nucleotide present in the wild-type strain [Stanway et al., supra; H. Toyoda et al., Supra]. Accordingly, position 2493 was never thought to contribute to the attenuation of the poliovirus strain 3 genome.

Therefore, the method of screening for variants of a strain 3 poliovirus according to this invention comprises the steps of sequencing the RNA of the virus and determining the nucleotide at position 2493. Most preferably, the portion of RNA to be sequenced consists of about 100-200 nucleotides flanking nucleotide 2493. This method may be used during amplification of the source virus (e.g., in vaccine production) to ensure maintenance of C at position 2493 in the viral genome.

This invention also relates to a method for increasing the attenuation of a strain 3 poliovirus produced from a true RNA virus cDNA, wherein the cDNA comprises the sequence ATT at positions 2492-2494. The method comprises mutagenizing the nucleotide at position 2493 from a T to a C and subsequently using the mutagenized cDNA to produce viable virus. The presence of a C instead of a U at position 2493 of strain 3 poliovirus RNA would be expected to alter the sixth amino acid of the viral capsid protein, VP1, from isoleucine to threonine, based on the genetic code. The condon encoding this amino acid spans nucleotides 2492-2494. Therefore, the method for increasing the attenuation of strain 3 poliovirus according to this invention may also include mutagenizing nucleotide 2494 from a T to either A, C or G.

In order that the invention described herein may be more fully understood, the following examples are set forth. It should be understood that these examples are for illustrative purposes only and are not to be construed as limiting this invention in any manner.

EXAMPLE 1 Purification Of A Strain 3 Poliovirus

All glassware utilized in the methods described below is either sterilized or treated with diethyl pyrocarbonate (DEP) to destroy RNases. All reagents are made up with water that had been treated with DEP prior to use.

Primary monkey kidney cells are infected with an attenuated strain 3 poliovirus, isolated by plaque purification from the "ORIMUNE" vaccine (Lederle Laboratories, Pearl River, N.Y.), at a low multiplicity of infection ("MOI"). The infected cultures are maintained at 34° C. in modified Earle's lacteal maintenance medium, pH 7.3, until the cell monolayer is destroyed (+4 cytopathic effect ("CPE")). The culture media (120 ml) is collected and centrifuged at 2,500 rpm for 20 minutes to remove any cellular debris. The supernatant is then filtered through a 0.22 μM Millex GV disc filter. The filtrate is then placed in quick seal high speed centrifuge tubes and spun in a 70.1 Ti rotor at 70,000 rpm for one hour at 4° C.

The virus pellet is resuspended in 4 ml of RNase-free TNE (10 mM Tris-HCl, 100 mM NaCl, 1 mM EDTA, pH 7.4) and the suspension transferred to a 15 ml polypropylene tube. The viral capsid are then dissociated by the addition of RNase-free SDS to a final concentration of 0.5%.

EXAMPLE 2 Isolation And Sequencing of Viral RNA

Viral RNA is then isolated by extracting the dissolved capsid as prepared in Example 1, with an equal volume of phenol:chloroform:isoamyl alcohol (25:24:1). The aqueous layer is removed and re-extracted with the same organic solution. One-half volume of 7.5M ammonium acetate and 2.5 volumes of 100% ethanol is added to the aqueous extract and the RNA is precipitated at -20° C. for at least 30 minutes. The RNA is then pelleted by centrifugation at 12,000 rpm for 30 minutes. The RNA pellet is washed twice with ice-cold 70% ethanol, dried under vacuum and resuspended in 20 μM of water. The integrity and concentration of the RNA is estimated by using agarose gel electrophoresis.

The viral RNA is then sequenced essentially by the method of DeBorde [D. C. Deborde et al., Anal. Biochem., 157, pp. 275-82 (1986)], the disclosure of which is incorporated herein by reference. The specific details of sequencing are described below.

Approximately 500 mg of purified viral RNA is heat denatured at 100° C. for 3 minutes in 200 mM Tris-HCl, pH 8.3, 200 mM KCl, 20 mM MgCl₂, 10 mM DTT and then quick-chilled by immersion into an ice bath. Primer, dATP and enzymes are then mixed in with the RNA, as described by DeBorde et al. Two and one-half μl of primer-RNA mix is then combined with an equal volume of various reaction mixes, in separate tubes, to give the following component concentrations:

Tube A: 50 mM Tris-HCl, pH 8.3, 50 mM KCl, 5 mM MgCl, 10 mM DTT, 100 μM each of dCTP, dGTP and dTTP, 5 μCi [³⁵ S]-dATP, 1.25 μM ddATP, 100 ng RNA, 15 ng primer and 2.8 units reverse transcriptase;

Tube C: same as tube A, except 20 μM dCTP, 2.5 μM ddCTP and no ddATP;

Tube G: same as tube A, except 20 μM dGTP, 3.0 μM ddGTP and no ddATP;

Tube T: same as tube A, except 20 μM dTTP, 7.5 μM ddTTP and no ddATP;

Tube N: same as tube A, except no ddATP.

Each tube is incubated at 42° C. for 20 minutes. Following this incubation, 1 μl of chase solution (1 mM each of dATP, dCTP, dGTP and dTTP and 2 units of terminal deoxynucleotidyl transferase) is added to each tube and the tubes incubated for another 30 minutes at 37° C. The reactions are stopped by freezing at -20° C. Prior to electrophoresis, 5 μl of formamide dye mixture is added to each tube. The samples are then heated to 100° C. for 3 minutes and 5 μl of each sample is loaded per gel lane.

The sequencing gels (35 cm×13 cm×0.1 mm) are 6% polyacrylamide (38:2; acrylamide:bis-acrylamide) containing 7M urea in TBE (89 mM Tris, 89 mM boric acid, 2 mM EDTA).

When the complete sequence of the attenuated strain 3 poliovirus RNA genome is compared to the published P3/Sabin cDNA sequence [Stanway et al., Proc. Natl. Acad. Sci. USA, 81, pp. 1539-43 (1984)], two nucleotide differences are observed, one of which causes an amino acid change. These are shown in the following table:

    ______________________________________                                         Nucleotide                                                                               P3/Sabin    P3/Sabin Amino Acid                                      Position  Stanway     RNA      Change                                          ______________________________________                                         2493      T           C        Ile to Thr                                                                     (VPI)                                           6061      C           U(T)     silent                                          ______________________________________                                    

The difference observed at position 2493 is important for several reasons. First, it encodes a significant amino acid change in the viral capsid protein VP1. Moreover, because wild-type strain 3 poliovirus has a U at this position (T in the cDNA), the purported presence of a T at 2493 of the attenuated strain 3 poliovirus cDNA may have obscured a significant difference between the wild-type and attenuated genomes.

EXAMPLE 3 Synthesis Of A Full-Length Poliovirus cDNA

The vital RNA obtained in the previous Example is used as a template for the synthesis of a double-stranded cDNA using the method of V. R. Racaniello et al., "Molecular Cloning Of Poliovirus cDNA And Determination Of The Complete Nucleotide Sequence Of The Viral Genome", Proc. Natl. Acad. Sci. USA, 78, pp. 4887-91 (1981). Specifically, 2.5 μg of vital RNA is used to produce first strand cDNA in a 100 μl reaction containing 50 mM Tris-HCl, pH 8.3, 10 mM MgCl₂, 50 mM KCl, 0.5 mM each of dATP, dTTP, dGTP and dCTP, 0.4 mM DTE, 30 μg/ml olio dT (12-18 nucleotides in length), 4 mM sodium pyrophosphate, 10 μCi/μl [α-³² P]-dATP, 1 unit/μl RNasin and 2 units/μl reverse transcriptase [Boehringer-Mannheim, Indianapolis, Ind.]. The reaction is incubated at 42° C. for 60 minutes. The reaction is stopped by the addition of EDTA to a final concentration of 50 μM. The solution is then phenol extracted and the aqueous layer applied to a 5×0.7 cm Sepharose CL-4B column. The column is developed with 0.3M NaCl, 10 mM Tris-HCl, pH 8.0, 1 mM EDTA. The profile thus obtained is depicted in FIG. 1.

Fractions 5-13 are pooled from the column and 20 μg of glycogen is added thereto. The cDNA is then precipitated at -20° C. by the addition of 2.5 volumes of ethanol, pelleted by centrifugation and resuspended in 25 μl of 10 mM Tris-HCl, pH 8.0, 1 mM EDTA (TE) for use in second strand synthesis.

Second strand synthesis is performed in a 100 μl reaction containing the single-stranded cDNA, 20 mM Tris-HCl, pH 7.4, 7 mM MgCl₂, 0.1M KCl, 50 μg/ml bovine serum albumin (BSA), 0.1 mM of each dNTP, 150 μM β-NAD, 5 μg/ml E. coli DNA ligase, 250 units/ml E. coli DNA polymerase I and 90 units/ml RNase H. The mixture is incubated at 15° C. for 60 minutes followed by 90 minutes at room temperature. The cDNA is then phenol extracted and chromatographed over Sepharose CL-4B as described above. The void volume fractions are pooled, precipitated with ethanol and resuspended in 25 μl TE, as described previously.

The resulting double-stranded cDNA is tailed with deoxycytidine (dC) in a 100 μl reaction containing 140 mM K-cacodylate, pH 7.2, 30mM Tris-HCl, 1mM CoCl₂, 1 mM DTT, 50 μg/ml BSA, 150 μM dCTP and 800 units/ml terminal deoxynucleotidyl transferase. The reaction is incubated at 37° C. for 60 minutes and the solution is then phenol extracted. The aqueous layer is removed, ether extracted and the tailed cDNA is precipitated therefrom with ethanol containing 20 μg glycogen. The resulting pellet is suspended in 50 μl of 10 mM Tris-HCl, pH 7.5, 100 mM NaCl, 1 mM EDTA (NTE).

EXAMPLE 4 Production Of A Poliovirus cDNA Library

Increasing amounts (1, 2, 5, 10 μl) of dC-tailed, double-stranded cDNA, produced as in Example 3, are annealed to 10 ng of PstI-cleaved, dG-tailed pUC9 (Pharmacia, Piscataway, N.J.) in NTE by heating the mixture to 68° C. in a water bath for 5 minutes, cooling the mixture in a 42° C. water bath and then slowly decreasing the temperature to room temperature overnight by shutting off the water bath. This allows for optimal hybridization between the dC tails on the cDNA and the dG tails on pUC9. The annealed mixtures are then used to transform E. coli DH5α cells (Bethesda Research Labs, Gaithersburg, Md.) using standard procedures. Ampicillin-resistant colonies are selected for screening.

EXAMPLE 5 Isolation Of Poliovirus Clones

Bacterial colonies are picked onto gridded plates and screened by colony hybridization, using linearized plasmid pOLIO (Sabin) as a probe [J. W. Almond et al., "Attenuation and Reversion to Neurovirulence of the Sabin Poliovirus Type-3 Vaccine", In Vaccines, 85, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y., pp. 271-77 (1985)]. Plasmid pOLIO (Sabin) is a plasmid containing a full-length cDNA derived from P3/Leon 12a₁ b [G. Stanway et al., Proc. Natl. Acad. Sci., USA, 81, pp. 1539-43 (1984)]. Of the 600 colonies screened by hybridization, 140 give positive hybridization signals. Small cultures (5 ml; LB media+ampicillin) of each of these positive clones are prepared and the plasmid DNA is isolated by the rapid boiling method [T. Maniatis et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory (1982)]. The isolated plasmids are cleaved with PstI, which is predicted to excise the cloned insert. Of the 140 clones which are analyzed in this way, the vast majority contain either small inserts (<1 kb) or do not release inserts upon cleavage with Pst1. We are, however, able to identify and map three cDNA clones to the P3/Sabin genome (pLL3-51, 69 and 82; see FIG. 2). Eight hundred colonies are screened by colony hybridization using anEcoRI fragment of pOLIO(Sabin) cDNA, which contains the 5'-most 747 nucleotides, as a probe. Of the thirty-three colonies that hybridize to the probe, five of the cDNAs represent nucleotides 85-779 of P2/Sabin. The origin of these cDNA clones is not known.

Subsequently, additional cDNA is annealed to pUC9 and transformed into DH5α cells. Eight hundred colonies are screened by colony hybridization using full-length pOLIO (Sabin) cDNA as a probe. Twenty-nine positive clones are analyzed by restriction enzyme cleavage. Eight of the positive clones that contain cDNA inserts are subjected to nucleotide sequencing from either end of the inserted DNA. Five of these clones (pLL3-239, 251, 253, 254 and 255) contains a cDNA that maps to the 3'-end of the P3/Sabin genome (FIG. 2).

As depicted in FIG. 2 (open bars), the 8 isolated and analyzed cDNA clones represent nucleotidase 3630 through the 3'-end of the P3/Sabin viral RNA genome.

To obtain additional 5'-cDNA clones, an oligonucleotide complementary to a stretch of bases from nucleotide 4317-4334 (see black shading in FIG. 2, pLL3-253) is synthesized. This oligonucleotide is used to prime cDNA synthesis on P3/Sabin viral RNA isolated according to Example 2 using cDNA synthesis methods described in Example 3. The resulting cDNAs are dc-tailed and cloned into PstI-cut, dG-tailed pUC9. The resulting plasmids are then used to transform DH5α cells. Transformants are screened by hybridization to pOLIO (Sabin 3). Eleven positive cDNA clones are isolated, analyzed by restriction enzyme digestion and sequenced (FIG. 2, black boxes). The eleven clones correspond to nucleotides 6 through 4329 of the P3/Sabin viral RNA.

Although cDNA clones representing all but the first 5 nucleotides of the P3/Sabin genome are identified, attempts to ligate together several of the 5'-cDNAs into a single cDNA proved difficult, due to the lack of convenient restriction sites. Therefore, one final round of cDNA cloning is undertaken in an attempt to obtain a single cDNA clone that represented approximately nucleotides 1 through 1900.

An oligonucleotide complementary to bases 1904-1922 is synthesized and used to prime cDNA synthesis on P3/Sabin RNA. The cDNA is then dC-tailed, inserted into dG-tailed, PstI-cut pUC9 and the resulting plasmid is used to transform DH5α cells. The transformants are again screened with pOLIO (Sabin 3). One of the resulting cDNA clones is identified as containing nucleotides 9-1900 of the viral RNA (FIG. 2, pLL3-307). This completes the cDNA cloning of the P3/Sabin genome.

EXAMPLE 6 Construction Of A Full-Length Poliovirus cDNA

The strategy for assembling the P3/Sabin cDNAs into a single full length cDNA is depicted in FIG. 3. First, clones pLL3-253 and pLL3-271 are assembled into a single clone by using a common HindIII site at nucleotide 4241. The resulting cDNA clone, which represents nucleotides 1500 through the 3' poly(A) of the genome, is called pLL3-I.

Next, the first 5 nucleotides of the P3/Sabin viral genome, which are missing from cDNA pLL3-277, are synthesized. This is achieved by synthesizing two complementary oligonucleotides using an automated oligonucleotide synthesizer and then annealing the oligonucleotides to one another. The resulting double-stranded oligonucleotide comprising from 5' to 3', a PstI site, nucleotides 1 through 34 of P3/Sabin cDNA and a BgeI sticky end, is then used to replace the 5'- most PstI/BglI fragment of pLL3-277. This results in a cDNA clone that represents nucleotides 1-1085. This clone is called pLL3-II.

The final step is the ligation of three SacI fragments representing nucleotides 1-747, 747-1895 and 1895 through the 3'-end, to form a full length P3/Sabin cDNA (termed pLL3-FL). This construction is performed in several separate steps using appropriate cDNA inserts from pLL3-II, pLL3-307 and pLL3-I.

EXAMPLE 7 Sequence Analysis Of The P3/Sabin cDNA

Sequence analysis of P3/Sabin cDNAs used to construct the full length cDNA is performed using three approaches. In the first approach, cDNAs are subcloned into M13 vectors. Nested deletions of the subcloned cDNA, using exonuclease III and mung bean nuclease [E. Ozkaynak et al., "A Unidirectional Deletion Technique for the Generation of Clones for Sequencing, Biotechniques, 5, pp. 770-73 (1987)], are then isolated and their nucleotide sequences determined.

In the second approach, cDNAs are isolated and digested with the RsaI. The resulting fragments are then "shotgunned" into M13 for sequence analysis. The last approach utilizes cDNA fragments that are isolated from clones and subcloned into M13 for sequence analysis.

Upon nucleotide sequencing of the entire P3/Sabin cDNA clone, 3 differences are found when the cDNA sequence is compared to the RNA sequence. These differences are summarized in the following table:

    ______________________________________                                         Nucleotide                                                                               P3/Sabin   P3/Sabin Amino Acid                                       Position  cDNA       RNA      Change                                           ______________________________________                                         198       G          A        5' untranslated                                  4466      T          C        His to Tyr (2C)                                  6334      T          C        silent                                           ______________________________________                                    

The nucleotide error at position 198 in the cDNA is situated in the 5' untranslated region of the genome. The function of that region remains unknown. The presence of a T, not a C, in the cDNA at position 4466 would create a change in residue 118 of the 2C protein from histidine to tyrosine. Although the function of the 2C protein remains undefined, the predicted amino acid change appears to be significant. The nucleotide change at position 6334 is in the polymerase gene, but is silent with respect to amino acid change.

In order to obtain a full-length cDNA clone which corresponds exactly to the poliovirus RNA sequence, the protocol depicted in FIG. 4, panels A and B is followed. Specifically, the P3/Sabin cDNA insert pLL3-II is subcloned into the PstI site of bacteriophage M13 for oligonucleotide-directed mutagenesis of nucleotide 198 from G to A as described below. An oligonucleotide spanning nucleotides 190-206 and containing an A at position 198 is synthesized for this purpose. The oligonucleotide has the sequence: 3'-GGCGTATCTGACAAGGG-5' (SEQ ID NO:4).

Mutagenesis is performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. The mutagenized insert is called pLL3-II(A). Following mutagenesis, pLL3-II(A) is sequenced to confirm the presence of an A at position 198. pLL3-II(A) is then removed from M13 with PstI and subsequently digested with SacI, which cuts at nucleotide 747. The fragment spanning nucleotides 1-747 is used to replace the corresponding fragment in pLL3-FL. The resulting full-length cDNA containing the mutagenized nucleotide at position 198 is referred to as pLL3-FL(A).

Next, pLL3-FL(A) is digested with PstI and the fragment spanning nucleotide 1 to 2604 is isolated. A portion of the isolated PstI fragment of pLL3-FL(A)--nucleotides 1 to 1922--is then subjected to polymerase chain reaction (PCR) using the following two oligonucleotides as primers:

    1) 5'-CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTG-3' (SEQ ID NO:5); and

    2) 5'-GAATCATGGTGTCTATCTC-3' (SEQ ID NO:6).

Oligonucleotide 1 represents, in a 5'-to-3' orientation, a PstI site, the T7 promoter and nucleotides 1-20 of the positive strand of P3/Sabin cDNA. Oligonucleotide 2 represents nucleotides 1904-1922 of the negative strand of P3/Sabin cDNA. PCR is performed using the GeneAMp kit from Perkin-Elmer Cetus.

The double-stranded DNA produced by PCR is then cleaved with EcoRI, which cuts at nucleotide 784 of P3/Sabin cDNA. The resulting 0.8 kb fragment representing the T7 promoter and the 5' end of P3/Sabin cDNA is isolated and subcloned into the SspI/EcoRI-cut pBR322. The resulting plasmid, termed pVR309, is confirmed to contain the T7 promoter and nucleotides 1-784 of P3/Sabin cDNA.

Next, pLL3-FL is cut with EcoRI, which cuts at nucleotides 784 and 2867. The fragment spanning these nucleotides is isolated and ligated to EcoRI cut PVR309. The resulting plasmid, pVR312, contains the first 2867 nucleotides of P3/Sabin cDNA.

In order to change the nucleotides at 4466 and 6334 to correspond to the viral RNA sequence, oligonucleotide-directed mutagenesis is performed as described above. The insert pLL3-253 is subcloned into the PstI site of M13. Oligonucleotide 3'-CTCGTTTGTGGCATAAC-5' (SEQ ID NO:7) is used to change nucleotide 4466 from a T to a C. Oligonucleotide 3'-CCCATGGGGATGCACCG-5' (SEQ ID NO:8) is used to change nucleotide 6334 from a T to a C. The mutagenized nucleotides are confirmed by nucleotide sequencing. The resulting corrected cDNA fragment termed pLL3-253(C).

Next, pLL3-253(C) and pLL3-271 are assembled into a single clone by cleaving each with HindIII and subsequently ligating the resulting large fragments together. The resulting DNA, referred to as pLL3-I(C), represents nucleotides 1500 through the 3-poly(A) end of P3/Sabin. The complete cDNA insert of pLL3-I(C) is then isolated by partial PstI digestion. A partial digest is necessitated because of the presence of a PstI site at nucleotide 2604. pLL3-I(C) is then ligated to PstI cut pBR322. A portion of that insert is then removed from pBR322 by a SmaI/NruI digest. The excised piece spans nucleotide 2766 through the 3'-end of the cDNA and into pBR322. A SmaI/NruI fragment is also excised from pVR312. This DNA represents sequences from the NruI site in pBR322 through the T7 promoter and the 5'-end of P3/Sabin cDNA and up to the SmaI site at nucleotide 2766. These two SmaI/NruI fragments are then ligated together to form a true P3/Sabin cDNA. The resulting plasmid is called pVR318. The presence of a full-length insert in pVR318 is confirmed by restriction enzyme analysis. However, upon sequencing of pVR318 it was determined that position 2493 was found to have a T, rather than a C.

In order to convert nucleotide 2493 to a C, a SacI/HindIII restriction fragment spanning nucleotides 1895 through 4241 from PVR318 is removed and replaced with a corresponding fragment from subclone pLL3-271. This scheme is depicted in FIG. 5. Specifically, pLL3-271 is digested with SacI and Hind III and the 2.346 kb fragment spanning nucleotides 1895 through 4241 is isolated by agarose gel electrophoresis. Plasmid PVR318 is partially digested with SacI and HindIII to remove the 2.346 kb fragment between nucleotides 1895 and 4241. The remaining 10.007 kb fragment is isolated by agarose gel electrophoresis and then ligated to the SacI/HindIII fragment from pLL3-271. The resulting ligation product, pLED3, represents a true full-length type 3 poliovirus vaccine strain cDNA. The sequence of pLED3 is depicted in FIGS. 6A-6K.

EXAMPLE 8 Transfection of Cells With Poliovirus P3/Sabin cDNA

Primary monkey kidney cells are grown to 80% confluency in duplicate 25 cm² flasks in Eagle's basal medium (BME) containing Hank's balanced salt solution, 0.35% bicarbonate and 10% calf serum. The cultures are maintained at 37° C. The medium is then removed and the cells transfected with pLED3 (prepared as in Example 7), added as a calcium phosphate precipitate in HEPES-buffered saline. After 20 minutes at room temperature, the cells are covered with Earle's lacteal maintenance medium and incubated for 4 hours at 37° C. The medium is then removed and the cells are washed once with fresh, warm medium. Two milliliters of 15% glycerol in HEPES-buffered saline are then added to the cells and incubation is continued for 3.5 minutes at 37° C. The glycerol is then removed and the cells are washed once again with fresh medium. One of the duplicate cultures is then covered with warm medium containing 1% Noble agar (Difco, Detroit, Mich.), while the other is covered with agar-free medium. The cultures are incubated at 34° C. for 1-5 days. Plaques are visualized by staining cells with 0.01% neutral red. Medium from the liquid culture is assayed for infectious poliovirus by plaque titration on Vero cells.

EXAMPLE 9 In Vitro Transcription of P3/Sabin cDNA

In vitro transcription is performed by the method of van der Werf et al., Proc. Natl. Acad. Sci. USA, 83, pp. 2330-34 (1986). pLED3, containing the T7 promoter-P3/Sabin construct, is linearized at an appropriate restriction site outside of the poliovirus sequences and purified by phenol extraction and ethanol precipitation. This template DNA is then added to a mixture containing 20 mM sodium phosphate, pH 7.7, 8 mM MgCl₂, 10 mM DTT, 1 mM spermidine-HCl, 50 mM NaCl and 1 mM each of dATP, dCTP, dGTP and dUTP. RNA synthesis is initiated by the addition of 10-15 units of T7 RNA polymerase/μg linearized template and allowed to continue for 30 minutes at 37° C.

Following RNA synthesis, the DNA template is digested away with Dnase I. The remaining RNA is purified by phenol extraction and ethanol precipitation in the presence of 2.5M ammonium acetate. The purified RNA is quantified by UV spectrophotometry.

EXAMPLE 10 Transfection Of Cells With In Vitro Transcribed P31/Sabin RNA

Semi-confluent monolayers of primary monkey kidney cell are prepared as described in Example 8. The cells are transfected with the RNA prepared in Example 9 using the method of A. Vaheri et al., "Infectious Poliovirus RNA: A Sensitive Method of Assay", Virology, 27, pp. 434-36 (1965). Specifically, the cell monolayers are washed with isotonic phosphate-buffered saline (PBS). After 15 minutes, the PBS is completely removed by aspiration. The monolayer is then coated with 0.25 ml of inoculum containing RNA and 500 μg/ml DEAE-dextran (Sigma, St. Louis, Mo.; 500,000 MW) in PBS. The infected monolayers are kept undisturbed at room temperature for 15 minutes, washed once with Earle's lacteal maintenance medium and then overlaid with either fresh medium or medium containing 1% agar, as described in Example 8. After 4-5 days at 34° C., virus is detected by plaque formation (in cultures containing agar) or by cytopathic effect.

Virus is then isolated and purified. Viral RNA is isolated by the techniques described previously. The isolated RNA is sequenced and the nucleotide at position 2493 is confirmed as cytosine, the same as the original source virus.

EXAMPLE 11 Evaluation Of The Effect Of Nucleotide 2493 On The Attenuation Of A Strain 3 Poliovirus

A derivative of an attenuated strain 3 poliovirus cDNA which contains a T instead of a C at nucleotide 2493 is constructed, such as pVR318. Viruses are produced from cDNAs with either Tor C at 2493 as in Examples 8 and 10. The resulting viruses are then tested for neurovirulence in monkeys according to standard protocols.

EXAMPLE 12 Increasing The Attenuation Of A Strain 3 Poliovirus

A strain 3 poliovirus cDNA which contains a T at position 2493 is subjected to site-directed mutagenesis to convert the T to a C, For example, pLED3 is digested with PstI to remove the poliovirus coding sequence. The poliovirus cDNA contains a single, internal PstI site at nucleotide 2604. Therefore, a PstI digestion yields a 5', 2.6 kilobase viral cDNA fragment and a 3', 4.8 kb viral cDNA fragment The 2.6 kb fragment is isolated by standard techniques and subcloned into the unique PstI site of vector M13mp18. Nucleotide 2493 is then converted from a T to a C by site-directed mutagenesis using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacturer's directions. Following mutagenesis, the 2.6 kb piece of cDNA is removed from the vector, relegated to the 3' piece of viral cDNA and the intact cDNA is cloned into pBR322. Both the mutagenized cDNA and the original cDNA are used to produce viable poliovirus as in Example 8. The resulting viruses are assayed for attenuation according to standard protocols. The mutagenized virus produced by the cDNA containing a C at nucleotide 2493 is expected to be more attenuated than the original virus.

Confirmatory sequencing of the entire cDNA sequence in plasmid pLED3 revealed an additional A (adenosine) in a region where a run of six A's is normally found in the viral genome (positions 4133-4138). The following steps describe a method to derive pLED3.2 from pLED3:

Isolation and mutagenesis of erroneous pLED3 sequence

Starting with purified pLED3 plasmid DNA, digest the DNA to completion with restriction enzymes SacI and HindIII. By a method of choice (i.e., gel electrophoresis, HPLC) purify the 2,346 base pair SacI/HindIII fragment representing cDNA nucleotides 1895-4241.

Subclone the purified restriction fragment into bacteriophage M13 to enable oligo-directed deletion mutagenesis. An DNA oligonucleotide spanning nucleotides 4121-4150 is synthesized for this purpose. The sequence of this oligomer is: 5'-CGCCTCAGTAAATTTTTTCAACCAACTATC-3' (SEQ ID NO:9).

Mutagenesis is performed using a "T7-GEN In Vitro Mutagenesis Kit" (United States Biochemical, Cleveland, Ohio) and following the manufacture's directions. The mutagenized insert is confirmed to possess 6 instead of 7 A's at positions 4133-4138 by sequence analysis. As above, the altered SacI/HindIII fragment is gel purified in preparation for ligation into the plasmid body lacking this restriction fragment.

Preparation of pLED3.2 from pLED3 Plasmid

Starting with purified pLED3 plasmid DNA, partially digest the DNA with restriction enzymes SacI and HindIII. Purify the 10,007 base pair SacI/HindIII fragment corresponding the plasmid minus the 2,346 base pair SacI/HindIII fragment discussed above.

Construction of pLED3.2

The purified plasmid body and mutagenized 2,346 base pair fragment are ligated together. Competent E. coli DH5α cells are transformed with the ligation products and tetracycline resistant colonies are selected. Identity of pLED3.2 is based on the confirmation of 6 A's at 4133-4138.

LED3 and VR318 viruses used in monkey NV test

The full-length cDNA in pLED3 (uncorrected) was infectious even though the additional A discussed above predicts a shift in the reading frame for viral proteins 2C, 3A, 3B, 3C and 3D. In vitro transcription of pLED3 cDNA using T7 RNA polymerase produced RNAs which possessed the erroneous seven A's as determined by direct sequence analysis. When these transcripts were used to transfect monkey kidney cells, virus was recovered which possessed the correct number of A's at this site.

Although the cDNAs in pLED3 and pVR318 differ by nucleotide composition at 2493 and by the number of A's between 4130-4136, the 2493 mutation was the only difference found to distinguish the viruses generated using these cDNAs. These cDNA-derived viruses were used to carry out the studies to assess the significance of 2493 mutation (described in the manuscript currently under review).

Polymerases are known to have problems copying from regions in which a single nucleotide is repeated several times. We proposed that T7 RNA polymerase may have generated by error some RNAs which contained six instead of seven A's while transcribing the run of seven A's from pLED3. Only those transcripts possessing six A's could produce virus upon transfection of cells. This proposed explanation seemed highly probable to a scientist who works on T-phage polymerase.

Recombinant DNA prepared by the processes of this invention is exemplified by a sample deposited with the American Type Culture Collection located at 12301 Parklawn Drive, Rockville Md. 20852 under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microorganisms for the Purposes of Patent Procedures. This sample was deposited on Apr. 17, 1990, identified as a plasmid containing the infectious full length copy of vaccine strain Sabin 3, and designated pLED3. This deposit was assigned ATCC accession number 40789.

EXAMPLE 13 Experimental Methods Sabin 3-Specific Mutation Within the N-Terminal Region of VP1 is Attenuating

Preparation of cDNA clones

Viral RNA was extracted from Sabin 3 vaccine virus (Lederle-Praxis Biologicals, N.Y.) after pelleting and SDS disruption as described before (20). A cDNA library was made from the viral RNA template using reverse transcriptase and a procedure described elsewhere (12). Oligonucleotide-directed mutagenesis was performed on DNAs subcloned in M13 grown in E. coli CJ236, to change the cDNA sequence at single base positions (16). For this purpose, antisense oligonucleotides (17-mers) were synthesized with the target nucleotide positioned in the center of the sequence. Common restriction enzyme cleavage sites EcoRI, HindIII, SacI, SmaI) were used to construct full-length cDNA clones. The full-length cDNAs (7,459 bp) were constructed in vector pBR322, selected and amplified using E. coli DH5α (Bethesda Research Laboratories).

Cells

Vero cells were propagated as monolayers in Eagle's MEM with Earle's salts supplemented with 0.11% bicarbonate and 10% fetal bovine serum. Primary African green monkey kidney cells (PCMK) were initiated in BME with Hank's salts containing 0.035% bicarbonate and 10% serum. At 70% confluence, the cells were refed with BME (Earle's salts) containing 5% serum. During poliovirus infection, all cultures were maintained in modified Earle's lactalbumin hydrolyzate maintenance medium (LMM; pH 7.34 without serum.

viruses

The Sabin 3 vaccine virus (RSO+2; Lederle-Praxis Biologicals, Pearl River, N.Y.) was prepared by a single passage of the rederived (RSO+1) manufacturing seed in PCMK cells. For experimental purposes, the same seed was used to produce RSO+2 vaccine virus in Vero cells. Virus NC1 represents vaccine (SO+2) produced in PCMK cells from the original (SO+1) manufacturing seed. The cDNA-derived viruses recovered from Vero cells transfected with T7 RNA polymerase transcripts of pLED3 or pVR318 were designated LED3+1 and VR318+1. These "seed" viruses were amplified once more in Vero cells at a multiplicity of infection (MOI) of 0.1 at 33.5°±0.5° to create virus stocks LED3+2 and VR318+2. Particular attention was paid to preparing the LED3+2 and VR318+2 virus samples in a manner as similar as possible to actual vaccine (ie. passage level and culture conditions). The characterization studies were carried out using these final virus stocks.

Determination of Virus Titer

The titer of infectious virus in samples was determined most often by microtitration on HEp-2 cells and expressed as tissue culture infectious dose (TCID)₅₀ per ml (1). In some cases, infectious titer was measured by plaque titration on Vero cell monolayers and therefore expressed as plaque-forming units (pfu) per ml. Serial ten-fold dilutions of virus prepared in LMM were used to inoculate 25 cm² confluent monolayers and allowed to absorb at 22° C. The cells were overlaid with 1.0% Noble agar (Difco Laboratories) in MEM (Hank's) plus 2% fetal calf serum then incubated at 33.5°±0.5° C. After 3 days, plaques were visualized and counted after staining the cells with neutral red (0.01% solution). For a given sample, there is routinely a 0.6 log difference in absolute numbers determined using the two methods described above; the TCID₅₀ value is always greater.

Nucleotide Sequence Determination

RNA sequence was determined using synthetic oligonucleotides and the dideoxynucleotide chain termination method as described before (20). Sequencing of cloned cDNAs was performed using Sequenase DNA Sequencing Kit (U.S. Biochemical).

In vitro RNA Synthesis and Transfection

DNA templates were prepared by digesting the plasmid DNA completely with PvuI restriction enzyme followed by extraction with phenol and ethanol precipitation, Transcription reaction mixtures containing 1 μg of DNA template were prepared as described by Moss et al. (12). RNA synthesis was initiated by addition of 30 units purified T7 RNA polymerase (Pharmacia) and the reactions were incubated at 37° C. for 90 mins. The DNA template and full-length transcription product were quantitated by comparing-the intensity of appropriately-sized bands to known amounts of a standard after electrophoresis in agarose gels stained with ethidium bromide. The 9.0 kb HindIII fragment of bacteriophage lambda DNA was used as the standard for template; a 7.5 kb single-stranded RNA marker (Bethesda Research Laboratories) was used for transcript. Numerical values were obtained by performing densitometry on a photographic negative taken of the gel. Aliquots of the transcription reaction mixture containing 25 μg of full-length transcript were used to transfect Vero cell monolayers (25 cm2) according to the procedure described by van der Werf et al, (23), The cDNA-derived viruses were harvested when the cell monolayer was completely destroyed (+4 CPE).

Isotopic Labeling of Virus and PAGE

Vero cells (6.5×10⁶) were infected at an MOI of 16 pfu/cell. The virus was allowed to absorb for 1 hour, the cells were then washed twice and covered with LMM before incubation at 33.5° C. Four and a half hours later, the medium was changed to methionine-free MEM (Select Amine Kit, GIBCO). After one hour, the cells were replenished with fresh medium supplemented with [³⁵ S]methionine (specific activity, >1000 Ci/mmol; Amersham) to a final concentration of 60 μCi/ml and incubation continued. At +4 CPE (within 24 hr), the medium containing virus was clarified, first by low-speed centrifugation (2500 rpm, 20 mins, 4° C.) and then by microfiltration (0.22 μm Millex-GV; Millipore). Virus was pelleted by ultracentrifugation using a Beckman 70.1Ti rotor (70K, 1 h, 4° C.), and then resuspended and boiled in Laemmli sample buffer (10). The samples were subjected to SDS-polyacrylamide gel electrophoresis and the proteins visualized by autoradiography.

Virus Thermostabiltiy Curves

Multiple 3.0 ml samples of virus (approx. 10⁷.3 pfu/ml in LMM) were incubated at room temperature (22° C.) or in water baths at 37° C. or 42° C. At 24 hour intervals over the course of 5 days, one sample from each incubation set was removed and frozen at -20° C. Once all the sample incubations were completed, the virus titer in each sample was measured by plaque titration on Vero monolayers.

Virus Growth Curves

Vero cell monolayers (10⁷ cells/25cm² flask) were infected with 4 pfu/cell and maintained as described above. At the indicated times post-infection, the medium from individual cultures was harvested then stored at 70° C.

Neurovirulence Test

Virus samples were tested for neurovirulence in Macca mulatto monkeys using two accepted procedures. As described in the United States Code of Federal Regulations (22), 0.2 ml or 0.5 ml of virus sample containing at least 10⁷.6 TCID₅₀ /ml is injected into the spinal cord (IS) or the thalamic region of each brain hemisphere (IT) of a monkey, for each test respectively. The WHO test (26) involves intraspinal injection of monkeys with 0.1 ml of virus (titer of 10⁶.5 -10⁷.5 TCID₅₀ /ml). After injection, the test animals are observed for 17-21 days for clinical signs of poliomyelitis. The animals are then sacrificed to allow histological examination of the brain and spinal cord for poliovirus lesions. The nervous tissue from each monkey is evaluated using a scoring method from 1 (low) to 4 (high) to reflect the severity of neuronal damage observed. The mean lesion score is an average of the scores recorded for monkeys within a group.

Statistical Analysis

Differences in mean lesion scores were analyzed by Analysis of Variance (ANOVA) model with Least Squares determinations employed for mean range testing. Frequencies of reactivity in the monkeys injected intrathalamically were tested for significance using Chi-square test.

Results

Construction of an authentic LED3 cDNA

Screening of the cDNA library through restriction analysis and nucleotide sequencing, identified four cDNA clones which represented all but the 5' six nucleotides of the Sabin 3 genome. The 5'-most cDNA clone was modified so that first six nucleotides of Sabin 3 cDNA were reconstructed and positioned under the control of bacteriophage T7 promoter by incorporation of a synthetic DNA oligonucleotide [5'-CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCGGGTTTG-3'] using the polymerase chain reaction. The orientation of the promoter ensures the synthesis of positive-sense RNA transcripts of the cloned cDNA. When the complete nucleotide sequence of the four cDNA subclones was compared to the LED3 RNA sequence (24) four nucleotide differences were identified. It is unclear whether the nucleotide differences represent errors made during reverse transcription or sequence divergence of a minor population within the vaccine virus sample. However, the full-length uncorrected cDNA was not infectious since virus nor CPE resulted from transfection of cells with T7-derived transcripts made of the cDNA. The sequence was corrected using oligonucleotide-directed mutagenesis (see Materials and Methods, Example 13) at three of these positions: 198 (G to A), 4466 (T to C) and 6334 (T to C). The last error, U instead of C at position 2493, fortuitously represented the Leon-like nucleotide at this position and was left untouched in the full-length cDNA in pVR318. This clone differs from LED3 RNA sequence only at n2493. The final step in the production of an authentic LED3 cDNA (pLED3.2) was accomplished by cDNA fragment exchange (SacI at n1895 to HindIII at n4241) between pVR318 and a vaccine cDNA subclone found to possess C at position 2493.

The full-length LED3 cDNA construct in cloning vector pBR322 is diagrammed in FIG. 7A. Nucleotide sequence analysis across the junction between the 3' terminus of the Sabin 3 cDNA and plasmid pBR322 indicated a poly-A tail consisting of 27 A's. A string of 15 C's derived from cDNA cloning steps immediately follows the poly-A tail.

In vitro transcription of the cDNAs in pLED3 and pVR318

The plasmid DNAs were digested with PvuI to 1) generate a linear DNA template containing the entire poliovirus cDNA for transcription by T7 RNA polymerase and to 2) create a template with the shortest span of extraneous vector sequence after the 3' end of the cDNA. PvuI corresponds to the first restriction site after the 3' end of the cDNA (exactly 126 bp) which is not present within the cDNA. Two PvuI fragments (4.4 kb and 8.1 kb) are produced from pLED3 and pVR318, but only the 8.1 kb fragment contains the T7 promoter based on sequence analysis. (refer to FIG. 7, A). FIG. 7B demonstrates that RNAs are efficiently synthesized by T7 RNA polymerase from the PvuI-restricted pLED3 and pVR318 and their size matches that predicted for runoff transcripts from the 8.1 kb fragment. In lanes 5 and 6, the two upper bands (low intensity) correspond to the 8.1 kb and 4.4 fragments produced by digestion of the plasmid DNAs with PvuI. Lane 4 contains HindIII/lambda DNA for a ds DNA size reference, The transcription product (most intense band in lanes 5 & 6) comigrates neatly with the 7.5 kb ssRNA marker (lanes 1-3) which is consistent for full-length (7585 nucleotides) RNAs. In comparison to virion RNA (lane 7), the in vitro transcripts of the poliovirus cDNA migrate slightly faster, perhaps due to the absence of covalently attached VPg protein. A transcription reaction containing 1 μl of template reproducibly produced 20-25 μg of full-length transcript. As shown here, the band corresponding to in 1 μg of a 50 μl reaction mixture (lane 5 or 6) is approximately the same intensity as the band representing 500 ng of 7.5 kb ssRNA (lane 2). Direct sequence analysis of these transcripts determined that there are two extra guanines at the 5' end immediately before the first poliovirus nucleotide and confirmed the presence of 126bp of extraneous pBR322 sequence at the 3'end after the poly-A, poly-C tail.

When Vero cells were transfected with the RNAs described above, cytopathic effects consistent with poliovirus infection were observed in 24 hours and virus were harvested (+4 CPE) within 48-72 hours. The specific infectivity of the transcripts was found to be 1-2×10² pfu/μg, about 3% that of vaccine RNA. As determined by sequencing, the RNA from the recovered viruses lacks the extraneous pBR322 sequence at the 3' end. In addition, the genomes of these viruses were verified to possess the attenuated nucleotides at positions 472(U), 2034(U), and 6061(U); nucleotide 2493 was the only known difference between LED3 (2493-C) and VR318 (2493-U).

Mutation at n2493 correlates with altered VP1 mobility

The mutation identified at position 2493 in the consensus genome of Sabin 3 vaccine virus predicts a Ile-6→Thr substitution in VP1 (24). To determine if the biochemical properties of VP1 would be altered by this amino acid substitution, the UPI proteins of LED3 altered by this amino acid substitution, the VP1 proteins of LED3 and VR318 were compared. The VP1 from LED3 contains threonine at residue 6, whereas VR318 has isoleucine at this site. The difference in molecular weights between these amino acids is minimal and yet the VP1 proteins from these viruses are distinguishable by SDS-PAGE (FIG. 8). A possible explanation for this observed difference in VP1 migration derives from the fact that the Thr side chain has a hydroxyl group that can form a hydrogen bond, whereas the Ile side chain is hydrophobic. As a result, the VP1 of VR318 virus with Ile-6 may bind more SDS and therefore migrate faster than the VP1 of LED 3. These data associate at least one biophysical change with the 2493 mutation. Whether this alteration in VP1 impacts the 3D structure of the virion is under evaluation.

Thermostability of LED3 and VR318 viruses

In the three-dimensional structure of poliovirus, the N-terminal region of VP1 is buried on the inside of native virions in close association with terminal regions of the other capsid proteins (6). In attempt to assess whether the 2493 mutation alters virion stability, LED3 and VR318 viruses were compared for susceptibility to thermal inactivation at several temperatures. As illustrated in FIGS. 9A-9C there was no loss in titer observed for either virus sample over a five day period at room temperature (22° C.). At 37° and 42° C., the titers of both virus samples decreased similarly. Despite heat treatment, the difference in plaque morphology between LED3 and VR318 viruses were preserved (see below).

Effect of n2493 on phenotypic markers

Small plaque size is often used to differentiate attenuated vaccine strains from virulent strains (13). In conducting a simple plaque titration of the cDNA-derived virus samples on Vero cells monolayers, it became apparent that the VR318 virus produces plaques that are obviously larger than those of LED3 virus. When the comparison included the pathogenic parent Leon strain, it was clear that VR318 plaques are of intermediate size (see FIG. 10). These data suggest that the single nucleotide difference at position 2493 between LED3 (C) and VR318 (U) is responsible for the increase in plaque size displayed by VR318 compared to LED3.

The traditional temperature-sensitivity (rct/40° C.) and "d" marker phenotypes of LED3 and VR318 viruses were also evaluated. Although these viruses were not distinguishable by either of these tests, both exhibited the attenuated phenotype compared to Leon virus (data not shown).

Growth curves of LED3 and VR318

To compare LED3 and VR318 replication, the growth kinetics of these viruses were compared in Vero cells under conditions known to be permissive for attenuated poliovirus (i.e, low temperature, high pH). FIG. 11 demonstrates there is no dramatic difference between the kinetics of virus release from Vero cells infected with LED3 and VR318. The titers achieved at 24 hours post-infection (10⁸ pfu/ml) indicated that neither of these viruses is severely debilitated under these conditions. A consistent, although subtle, difference was observed in the fold increase in titer between 8 and 12 hours post-infection suggesting that VR318 may have the ability to replicate faster than LED3. During this time period, the titer of VR318 increased 200-fold compared to only a 30-fold increase for LED3. Whether modification of culture conditions (i.e., lower pH) exaggerates the observed difference between LED3 and VR318 growth kinetics is under evaluation.

Neurovirulence of LED3 and VR318 in monkeys

Through the construction and neurovirulence testing of recombinant viruses derived from full-length Sabin 3 and Leon cDNAs, Westrop eta. (25) have correlated the attenuated phenotype of Sabin 3 with the point mutations at positions 472 and 2034. Applicants' identification of the Sabin 3-specific point mutation at 2493 (24), raised the question of whether this mutation might also be a determinant of attenuation.

The cDNA-derived viruses, LED3 and VR318, were compared to appropriate controls for neurovirulence as tested in monkeys by procedures contained in either the WHO or United States CFR requirements for the acceptance of vaccine lots (see Materials and Methods, Example 13). Differences between these procedures include the route of inoculation (intrathalamic & intraspinal versus only intraspinal) as well as amount (volume and titer) of the sample injected.

Table 1A lists the neurovirulence data from the CFR intraspinal (IS) test in which LED3 and VR318 were tested concurrently and compared to test results of actual vaccine (RSO+2) produced on primary monkey kidney cells (PCMK). Testing of RSO+2 (Vero) vaccine demonstrated that the use of Vero cells to produce vaccine virus as described had no effect on attenuation. The mean lesion scores produced by RSO+2 (PCMK), RSO+2 (Vero) and LED3+2 (Vero) were 0.52, 0.36 and 0.34, respectively. These data demonstrate that LED3 is no more neurovirulent than current vaccine virus. Interestingly, the mean lesion score of monkeys receiving VR318 was 1.31 which was significantly higher (p<0.01) than the scores produced by the other viruses. These data indicate that VR318 virus is not equivalent to current RSO+2 vaccine and that presence of C (LEd3 and RSO+2) instead of U (VR318) at nucleotide position 2493 is attenuating.

As above, the neurovirulence of LED3 and VR318 after intrathalamic route of injection into monkeys was compared to data obtained from four complete tests of current RSO+2 vaccine (Table 1B). Since brain tissue is less susceptible than spinal tissue to poliovirus infection, neurovirulence using this procedure is based essentially on whether any lesions are visible and in what percentage of the monkeys rather than a lesion score. As demonstrated by actual RSO+2 vaccine, a low level of reactivity in a group of monkeys (4.0%) is typical and desirable. Of the 10 monkeys receiving LED3, none exhibited lesions. VR318, however, produced lesions in 2 of 10 (20%) test animals. Although a group of 30 monkeys are required for a complete IT test, the increased percentage of positive monkeys in the VR318 group is highly unusual and predicts that VR318 would fail CFR IT data indicate that when compared to current RSO+2 vaccine, LED3 virus is equivalent and VR318 is more neurovirulent.

Using the WHO test procedure, LED3 and VR318 were evaluated concurrently with virus NC1, which is equivalent to the attenuated type 3 WHO test reference. As listed in Table 2, the mean lesion scores for LED3 and VR318 were 0.21 and 1.51, respectively. Confirmed by three different methods, the demonstration that LED3 is more attenuated than VR318 is unequivocal. The interpretation of the WHO test data is made somewhat difficult however by the performance of the attenuated reference, NC1. In this test, NC1 produced a mean lesion score of 1.08, which falls between the values calculated for LED3 and VR318. Although a mean score of 1.08 is high for this test reference, the comparison of reactivity between NC1, LED3 and VR318 is valid since they were tested concurrently. A statistical comparison of the resultant scores points to the fact that by this test procedure, VR318 cannot be distinguished from the attenuated reference. Based on this preliminary data, it is unclear whether VR318 would fail in a full WHO test involving 24 monkeys per group. On the other hand, the lesion score associated with LED3 was shown to be significantly lower than either VR318 or the NC1 reference (p<0.01).

Interestingly, the NC1 reference virus represents vaccine material manufactured using Sabin original (SO), not the rederived Sabin original (RSO) seed. Nucleotide sequence determination at position 2493 of SO+2 vaccine (Lederle; same as NC1) revealed a 1:1 mixture of C and the variant U. Similar evaluation of current RSO+2 vaccine (Lederle) demonstrated only C at this position supporting the fact that the RSO seed is a purified derivative of the SO strain (24). Since test samples LED3, VR318 and NC1 did not differ in nucleotide composition at position 472 based on RNA sequence determination, the increased level of neurovirulence exhibited by NC1 compared to LED3 using the WHO procedure likely derives from the subpopulation of 2493-U variants in the NC1 pool of virus.

The identification of a new Sabin 3-specific mutation at position 2493 which encodes an isoleucine to threonine change at the sixth amino acid of VP1 (24) is shown here to be a determinant of attenuation. The attenuation of Sabin 3 poliovirus has been correlated to point mutations at positions 472 and 2034 by others previously (25). To assess the contribution of the 2493 mutation, a virus was produced using a completely verified vaccine cDNA (LED3) and compared it to a derivative, VR318, which was the same except that it possessed the Leon-like nucleotide (U) instead of C at this position. The additional mutation in LED3 correlated to smaller plaque size as well as decreased neurovirulence in monkeys.

The data presented herein demonstrate that the biological properties associated with attenuation of the Sabin 3 vaccine strain were preserved in LED3. There are many benefits associated with the use of a Sabin 3 cDNA-derived seed strain. The stock volumes of both the original and rederived Sabin seeds, SO and RSO respectively, are limited due to the fact that they are virus plaque isolates. When seeds are stored in the form of a cloned, genetically defined cDNA, the limitations on seed supply are removed; in addition, such seeds can be preserved indefinitely. Without restriction on seed supply there is increased flexibility in the multiplicity of infection (MOI) that can be used to produce vaccine. Weeks-Levy et al. (24) showed that virus samples produced using accepted manufacturing seeds at higher MOI's are genetically more homogeneous. Since RNA viruses by their nature generate variants at height frequency than DNA viruses, an RNA virus seed established in the form of a cloned, genetically defined DNA should be the most homogeneous. By this approach, the amount of undesirable variants that could be selectively amplified during passage have been minimized which translates to increased genetic stability. Passage studies to assess whether LED3 constitutes a more genetically stable see as compared to other manufacturing seeds for Sabin 3 vaccine is currently under evaluation.

Based on nucleotide sequence determination, Sabin 3 variants possessing U at 2493 during passage in vitro were shown to accumulate more rapidly than variants possessing C at 472 (24). In the same study, the Sabin 3 component of different OPVs was found to vary greatly in the proportion of C and U at 2493. Although the data presented here do not address how the proportion of 2493-U variant affects the acceptability of vaccine lots, data comparing viruses LED3 (2493-C) and VR318 (2493-U) suggest that the result will depend on neurovirulence test method used for the evaluation. Of particular interest was the ability of the CFR intrathalamic test method to distinguish LED3 and VR318. The unusually high reactivity of VR318 in the brain compared to LED3 or RSO+2 vaccine suggests that the interaction between virus and brain tissue is enhanced when virus possesses U at 2493. Since the WhO test method does not evaluate vaccine by intrathalamic route of injection, our data suggest that this test would be less likely to detect 2493-U virus subpopulation in vaccine lots.

A detection method incorporating the polymerase chain reaction was used recently by Chumakov et al. (2) to determine that vaccine lots containing 472-C variants comprising greater than 1.17% of total virus failed the WHO neurovirulence test. The determination of equivalence at position 472 for LED3 and VR318 was based on sequence analysis of the viral RNA. It was determined that a 10% variant subpopulation is the limit of detection using this method (20). It is possible that the more sensitive PCR method would detect 472-C subpopulation in both LED3 and VR318 virus preparations. It is however unlikely that these virus samples would differ in proportion of 472-C variant because LED3 and VR318 are equivalent passage levels from the cloned cDNA and were generated under identical conditions. A preliminary evaluation of LED3 and VR318 using a PCR method to detect 472-C variants shows that these samples cannot be distinguished.

There are several lines of data supporting the selective advantage for Sabin 3 variants possessing U at 2493 in vivo, as well as in vitro. Stool isolate KW4, recovered 5 days post-vaccination, was shown previously to differ from the administered vaccine strain at three positions: 472=C, 2493=U, and 6061=U/C (20). Based on the data presented here, the intermediate level of neurovirulence observed for KW4 in that study can now be attributed to the mutation at 2493 as well as 472. Weeks-Levy et al. (24) found that two of three virus isolates recovered from nervous tissue (brain or spinal cord) of monkeys that had been injected intraspinally with NC1, the attenuated reference virus, possessed U at 2493. That isolate NC1-679B had U at 2493 without loss of the attenuated U at 472 may specifically relate to how this virus spread and replicated in the brain. The data are consistent with observations of increased neurovirulence for VR318 compared to LED 3 as tested by CFR intrathalamic method.

The mechanism by which the change from U to C at 2493 attenuates LED3 compared VR318 is unclear. This mutation alters the sixth amino acid capsid protein VP1. In the three-dimensional structure of poliovirus, Hogle et al. (6) show that this region of VP1 is buried on the inside of the native virion. More recently, Fricks and Hogle (5) demonstrated that upon attachment to susceptible cells, the virion undergoes conformational changes resulting in release of capsid protein VP4 and externalization of the amino terminus of VP1. These authors demonstrated further that exposure of the amino terminus of VP1 was required for attachment to liposomes and proposed that these events play a role in the mechanism of cell entry. Consistent with these observations, Kirkegaard (8) described two poliovirus mutants with different small deletions in the amino terminal region of VP1 which flank either side of residue six as defective in the physical release of viral RNA from the capsid during normal infection. Both of these deletion mutants exhibited small plaque phenotype. From these data, it is easy to speculate that the mutation is Sabin 3 at position 2493 also affects viral uncoating.

In addition to the mutation in VP3 (2034) of Sabin 3, attenuation determinants have been mapped to the capsid proteins in Sabin 1 (14) and a type 2 strain (P2/712) which is closely related to Sabin 2 (16). Although these other mutations occur within capsid protein VP1, the structural mutation described in this study is the first one to be mapped to the amino terminal region of VP1.

While a number of embodiments of this invention have been described hereinabove, it is apparent that the basic constructions can be altered to provide other embodiments which utilize the processes, recombinant DNA molecules, cDNA molecules and transformed hosts of this invention. Therefore, it will be appreciated that the scope of this invention is to be defined by the claims appended hereto rather than by specific embodiments which have been presented hereinbefore by way of example.

                                      TABLE 1                                      __________________________________________________________________________     NEUROVIRULENCE OF LED3 AND VR318 STRAINS                                       USING CFR NV TEST PROCEDURE                                                    A) INTRASPINAL.sup.a                                                                      CELL                    MEAN                                                   SUB-  NUCLEOTIDE AT                                                                             NO. OF LESION                                      GROUP                                                                               VIRUS STRATE                                                                               472  2493  MONKEYS                                                                               SCORE                                       __________________________________________________________________________     1    RSO + 2                                                                              PCMK  U    C     24     0.52                                        2    RSO + 2                                                                              VERO  U    C     12     0.36                                        3    LED3 + 2                                                                             VERO  U    C     16     0.34                                        4    VR318 + 2                                                                            VERO  U    U     16     1.31.sup.b                                  __________________________________________________________________________     B) INTRATHALAMIC.sup.c                                                                    CELL                                                                           SUB-  NUCLEOTIDE AT                                                                             NO. OF PERCENT                                     GROUP                                                                               VIRUS STRATE                                                                               472  2493  MONKEYS                                                                               POSITIVE                                    __________________________________________________________________________     5    RSO + 2                                                                              PCMK  U    C     120    4                                           6    LED3 + 2                                                                             VERO  U    C     10     0                                           7    VR318 + 2                                                                            VERO  U    U     10     20.sup.d                                    __________________________________________________________________________      .sup.a 0.2 ml of virus (titer ≧7.6 log TCID.sub.50 /ml)                 administered intraspinally.                                                    .sup.b Group 4 > 1, 2, 3 (p < 0.01) by ANOVA and mean range testing.           .sup.c 0.5 ml of virus (titer ≧7.6 log TCID.sub.50 /ml)                 administered intracerebrally into the thalamic region of each hemisphere.      .sup.d Group 7 > 5, 6 (p < 0.05) by Chisquare test.                      

                                      TABLE 2                                      __________________________________________________________________________     NEUROVIRULENCE OF LED3 AND VR318 STRAINS                                       USING WHO NV TEST PROCEDURE.sup.a                                                         CELL                    MEAN                                                   SUB-  NUCLEOTIDE AT                                                                             NO. OF LESION                                      GROUP                                                                               VIRUS STRATE                                                                               472  2493  MONKEYS                                                                               SCORE                                       __________________________________________________________________________     1    LED3 + 2                                                                             VERO  U    C     6      0.21.sup.b                                  2    NC1.sup.c                                                                            PCMK  U    U/C   6      1.08                                        3    VR318 + 2                                                                            VERO  U    U     6      1.51                                        __________________________________________________________________________      .sup.a 0.1 ml of virus (titer 6.5 to 7.5 log TCID.sub.50 /ml) administere      intraspinally.                                                                 .sup.b Group 2 < 1, 3 (p < 0.1) by ANOVA and mean range testing.               .sup.c NC1 (SO + 2) is an attenuated type 3 reference for the WHO NV test                                                                               

REFERENCES

1. Albrecht, P., J. C. Enterline, E. J. Boone, and M. J. Klutch. 1983. Polioviurs and polio anitbody assay in HEp-2 and Vero cell cultures. J. Biol. Stand. 11:91-97.

2. Chumakov, K. M., L. B. Powers, K. E. Noonan, I. B. Ronison and I. S. Levenbook. 1991. Correlation between amount of virus with altered nucleotide sequence and the monkey test for acceptability of oral poliovirus vaccine. Proc. Natl. Acad. Sci. USA 88:199-203.

3. Domingo, E. 1989. RNA virus evolution and the control of viral disease. Prog. Drug Res. 33;93-133.

4. Dunn, G., N. T. Begg, N. Cammack, and P. D. Minor. 1990. Virus exretion and mutation by infants following primary vaccination with live oral poliovaccine from two sources. J. Med. Virol. 32:92-95.

5. Fricks, C. E. and J. M. Hogle. 1990 Cell-induced conformational change in poliovirus: Externalization of the amino terminus of VP1 is responsible for liposome binding. J. Virol. 64:1934-1945.

6. Hogle, J. M., M. Chow, and D. J. Filman. 1985. Three-dimensional structure of poliovirus at 2.9A resolution. Science 229:1358-1365.

7. Kew, O. M., B. K. Nottay, M. H. Hatch, J. H. Nakano and J. F. Obijeski. 1981. Multiple changes can occur in the oral polio vaccines upon relication in humans. J. Gen. Virol. 56:337-347.

8. Kirkegaard, K. 1990. Mutations in VP1 of poliovirus specifically affect both encapsidation and release of viral RNA. J. Virol. 64:195-206.

9. Kohara, M., A. Shinobu, S. Kuge, B. L. Semler, T. Komatsu, M. Arita, H. Itoh and A. Nomoto. 1985. An infectious cDNA clone of the poliovirus Sabin strain could be used as a stable repository and inoculum for the oral polio live vaccine. Virology 151:21-30.

10. Laemmli, U. K. 1970. Cleavage of structural proteins during the assembly of the head of bacteriophage T4. Nature (London) 227:680-685.

11. Melnick, J. L., M. Benyesh-Melnick, and J. C. Brennan. 1959. Studies on live poliovirus vaccine. JAMA. 171:63-70.

12. Moss, E. G., R. E. O'Neill, and V. R. Racaniello. 1989. Mapping of attenuating sequences of an avirulent poliovirus type 2 strain. J. Virol. 63:1884-1890.

13. Nakano, J. H., M. H. Hatch, M. L. Thieme and B. Nottay. 1978. Parameters for differentiating vaccine-derived and wild poliovirus strains. Prog. Med. Virol. 24:178-206.

14. Nomoto, A. and E. Wimmer. 1987. Genetic studies of the antigenicity and the attenuation phenotype of poliovirus, p. 107-134. In W. C. Russell and J. W. Almond (ed.), Molecular basis of virus disease. Cambridge University Press, Cambridge.

15. Racaniello, V. R. 1988. Poliovirus neurovirulence. Adv. Virus Res. 34:217-246.

16. Ren, R., E. G. Moss, and V. R. Racaniello. 1991. Indentification of two determinants that attenuate vaccine-related type 2 poliovirus. J. Virol. 65:1377-1382.

17. Stanway, G., A. J. Cann, R. Hauptmann, P. Hughes, L. D. Clarke, R. C. Mountford, P. D. Minor, G. C. Schild, and J. W. Almond. 1983. The nucleotide sequence of poliovirus type 3 leon 12 a₁ b: comparison with poliovirus type 1. Nucleic Acids Res. 11:5629-5643.

18. Stanway, G., P. J. Hughes, R. C. Mountford, P. Reeve, P. D. Minor, G. C. Schild, and J. W. Almond. 1984. Comparison of complete nucleotide sequences of the genomes of the neurovirulent poliovirus P3/Leon/37 and its attenuated Sabin vaccine derivative P3/Leon12a₁ b. Proc. Natl. Acad. Sci. USA 81:1539-1543.

19. Stones, P. B., C. R. Macdonald, J. K. McDougall and P. F. Ramsbottom. 1964. Preparation and properties of a derivative of Sabin's type 3 poliovirus strain Leon 12a₁ b. 10th Symposium of the European Association against Poliomyelitis, Warsaw, pp. 390-397.

20. Tatem, J. M., C. Weeks-Levy, S. J. Mento, S. J. MiMichele, A. Georgiu, W. F. Waterfiled, B. Sheip, C. Costalas, T. Davies, M. B. Ritchey and F. R. Cano. J. Med. Virol., in press.

21. Toyoda, H., Kohara, M., Kataoka, Y., Suganuma, T., Omata, T., Imura, N., and Nomoto, A. 1984. Complete nucleotide sequences of all three poliovirus serotype genomes: Implication for gentic relationship, gene function and antigenic determinants. J. Mol. Biol. 174:561:585.

22. United States Code of Federal Regulations. 1990. Poliouvirus vaccine live oral. Title 21, Sec. 630.10-17.

23. van der Werf, S., J. Bradley, E. Wimmer, F. W. Studier and J. J. Dunn. 1986. Sythesis of infectious poliovirus RNA by purified T7 RNA polymerase. Proc. Natl. Acad. Sci. USA 83:2330-2334.

24. Weeks-Levy, C., J. M. Tatem, S. J. DiMichele, W. Waterfield, A. F. Georigu and S. J. Mento. Submitted for publication.

25. Westrop, G. D., K. A. Wareham, D. M. A. Evans, G. Dunn, P. D. Minor, D. I. Magrath, F. Taffs, S. Marsden, M. A. Skinner, G. C. Schild and J. W. Almond. 1989. Genetic basis of attenuation of the Sabin type 3 oral poliovirus vaccine. J. Virol. 63:1338-1344.

26. World Health Organization. 1990. Requirements for poliomyelitis vaccine (oral). W.H.O. Tech. Rep. Ser. 800:30-36.

    __________________________________________________________________________     SEQUENCE LISTING                                                               (1) GENERAL INFORMATION:                                                       (iii) NUMBER OF SEQUENCES: 9                                                   (2) INFORMATION FOR SEQ ID NO:1:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 7432 base pairs                                                    (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                          (iv) ANTI-SENSE: NO                                                           (ix) FEATURE:                                                                  (A) NAME/KEY: CDS                                                              (B) LOCATION: 743..7361                                                        (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                        TTAAAACAGCTCTGGGGTTGTTCCCACCCCAGAGGCCCACGTGGCGGCTAGTACACTGGT60                 ATCACGGTACCTTTGTACGCCTGTTTTATACTCCCTCCCC CGCAACTTAGAAGCATACAA120               TTCAAGCTCAATAGGAGGGGGTGCAAGCCAGCGCCTCCGTGGGCAAGCACTACTGTTTCC180                CCGGTGAGGCCGCATAGACTGTTCCCACGGTTGAAAGTGTCCGATCCGTTATCCGCTCAT240                GTACTTCGAGAAGC CTAGTATCGCTCTGGAATCTTCGACGCGTTGCGCTCAGCACTCAAC300               CCCGGAGTGTAGCTTGGGCCGATGAGTCTGGACAGTCCCCACTGGCGACAGTGGTCCAGG360                CTGCGCTGGCGGCCCACCTGTGGCCCAAAGCCACGGGACGCTAGTTGTGAACAGGGTG TG420               AAGAGCCTATTGAGCTACATGAGAGTCCTCCGGCCCCTGAATGCGGCTAATTCTAACCAT480                GGAGCAGGCAGCTGCAACCCAGCAGCCAGCCTGTCGTAACGCGCAAGTCCGTGGCGGAAC540                CGACTACTTTGGGTGTCCGTGTTTCCTTTTAT TCTTGAATGGCTGCTTATGGTGACAATC600               ATAGATTGTTATCATAAAGCGAGTTGGATTGGCCATCCAGTGTGAATCAGATTAATTACT660                CCCTTGTTTGTTGGATCCACTCCCGAAACGTTTTACTCCTTAACTTATTGAAATTGTTTG720                AAGACAG GATTTCAGTGTCACAATGGGAGCTCAAGTATCATCCCAAAAAGTA772                       MetGlyAlaGlnValSerSerGlnLysVal                                                 1510                                                                           GGCGCTCACGAGAATTCTAACCGAGCCTACGGTGGTTCTACGATCAAC820                            GlyAlaHisGluAsnSerAsnArgAlaTyrGlyGlySerThrIleAsn                               1520 25                                                                        TACACCACAATTAATTATTATAAAGATTCCGCAAGTAATGCGGCGTCC868                            TyrThrThrIleAsnTyrTyrLysAspSerAlaSerAsnAlaAlaSer                               3035 40                                                                        AAACAAGATTACTCACAGGATCCATCAAAATTCACCGAGCCACTAAAG916                            LysGlnAspTyrSerGlnAspProSerLysPheThrGluProLeuLys                               455055                                                                         GACGTGCTCATAAAAACAGCTCCAGCACTCAATTCACCAAATGTGGAA964                            AspValLeuIleLysThrAlaProAlaLeuAsnSerProAsnValGlu                               606570                                                                         GCGT GTGGGTATAGTGATAGAGTGTTGCAACTCACTTTAGGCAATTCC1012                          AlaCysGlyTyrSerAspArgValLeuGlnLeuThrLeuGlyAsnSer                               75808590                                                                       ACTATTACTACACAGGAGGCAGCAAATTCAGTAGTGGCTTACGGACGT1060                           ThrIleThrThrGlnGluAlaAlaAsnSerValValAlaTyrGlyArg                               95100 105                                                                      TGGCCTGAGTTTATTAGAGATGACGAAGCAAACCCGGTGGACCAACCA1108                           TrpProGluPheIleArgAspAspGluAlaAsnProValAspGlnPro                               110115 120                                                                     ACTGAACCAGATGTGGCTACATGCAGATTCTACACACTAGACACTGTA1156                           ThrGluProAspValAlaThrCysArgPheTyrThrLeuAspThrVal                               125130135                                                                      ATGTGGGGTAAGGAGTCGAAAGGCTGGTGGTGGAAGTTACCTGACGCA1204                           MetTrpGlyLysGluSerLysGlyTrpTrpTrpLysLeuProAspAla                               140145150                                                                      CTGA GAGACATGGGTCTGTTTGGACAAAACATGTATTACCACTACCTA1252                          LeuArgAspMetGlyLeuPheGlyGlnAsnMetTyrTyrHisTyrLeu                               155160165170                                                                   GGAAGATCCGGGTACACTGTGCACGTGCAGTGTAATGCATCCAAATTT1300                           GlyArgSerGlyTyrThrValHisValGlnCysAsnAlaSerLysPhe                               175180 185                                                                     CACCAAGGTGCACTCGGGGTGTTTGCGATTCCTGAGTATTGTCTGGCG1348                           HisGlnGlyAlaLeuGlyValPheAlaIleProGluTyrCysLeuAla                               190195 200                                                                     GGTGACAGTGACAAGCAAAGGTACACTAGTTATGCAAATGCGAATCCA1396                           GlyAspSerAspLysGlnArgTyrThrSerTyrAlaAsnAlaAsnPro                               205210215                                                                      GGTGAAAGAGGGGGAAAATTTTACTCCCAATTCAACAAGGATAACGCA1444                           GlyGluArgGlyGlyLysPheTyrSerGlnPheAsnLysAspAsnAla                               220225230                                                                      GTAA CATCCCCAAAAAGAGAGTTCTGCCCAGTGGATTATCTCCTGGGA1492                          ValThrSerProLysArgGluPheCysProValAspTyrLeuLeuGly                               235240245250                                                                   TGTGGGGTGTTACTGGGAAATGCCTTTGTATACCCACATCAAATCATT1540                           CysGlyValLeuLeuGlyAsnAlaPheValTyrProHisGlnIleIle                               255260 265                                                                     AATCTGAGGACCAACAACAGCGCAACTATTGTCCTACCATATGTGAAT1588                           AsnLeuArgThrAsnAsnSerAlaThrIleValLeuProTyrValAsn                               270275 280                                                                     GCTTTGGCCATTGATTCAATGGTTAAACACAACAACTGGGGCATTGCC1636                           AlaLeuAlaIleAspSerMetValLysHisAsnAsnTrpGlyIleAla                               285290295                                                                      ATTCTGCCCTTATCACCGCTGGATTTTGCTCAAGATTCATCAGTTGAA1684                           IleLeuProLeuSerProLeuAspPheAlaGlnAspSerSerValGlu                               300305310                                                                      ATTC CAATTACTGTGACAATTGCCCCAATGTGTAGCGAGTTCAACGGC1732                          IleProIleThrValThrIleAlaProMetCysSerGluPheAsnGly                               315320325330                                                                   CTTCGCAACGTGACTGCACCTAAATTTCAAGGACTACCAGTGTTGAAC1780                           LeuArgAsnValThrAlaProLysPheGlnGlyLeuProValLeuAsn                               335340 345                                                                     ACTCCTGGTAGTAACCAGTACCTGACGTCAGACAACCACCAATCACCA1828                           ThrProGlySerAsnGlnTyrLeuThrSerAspAsnHisGlnSerPro                               350355 360                                                                     TGCGCAATCCCAGAATTTGATGTCACTCCGCCTATTGATATCCCAGGT1876                           CysAlaIleProGluPheAspValThrProProIleAspIleProGly                               365370375                                                                      GAGGTTAAAAACATGATGGAGCTCGCCGAGATAGACACCATGATTCCT1924                           GluValLysAsnMetMetGluLeuAlaGluIleAspThrMetIlePro                               380385390                                                                      CTCA ATTTGGAGAGCACCAAGAGAAACACAATGGACATGTACAGAGTT1972                          LeuAsnLeuGluSerThrLysArgAsnThrMetAspMetTyrArgVal                               395400405410                                                                   ACTCTGAGCGACAGTGCCGATCTATCGCAACCAATTTTGTGCTTGTCA2020                           ThrLeuSerAspSerAlaAspLeuSerGlnProIleLeuCysLeuSer                               415420 425                                                                     CTATCCCCAGCATTTGATCCGCGCTTGTCACACACCATGCTTGGGGAA2068                           LeuSerProAlaPheAspProArgLeuSerHisThrMetLeuGlyGlu                               430435 440                                                                     GTACTGAACTATTATACTCATTGGGCCGGGTCCTTGAAATTTACCTTC2116                           ValLeuAsnTyrTyrThrHisTrpAlaGlySerLeuLysPheThrPhe                               445450455                                                                      CTGTTCTGTGGTTCAATGATGGCTACGGGGAAAATCCTAGTGGCCTAT2164                           LeuPheCysGlySerMetMetAlaThrGlyLysIleLeuValAlaTyr                               460465470                                                                      GCAC CACCAGGTGCACAACCCCCCACCAGCCGTAAGGAGGCTATGTTG2212                          AlaProProGlyAlaGlnProProThrSerArgLysGluAlaMetLeu                               475480485490                                                                   GGCACACATGTCATTTGGGATCTTGGCCTGCAATCATCTTGTACTATG2260                           GlyThrHisValIleTrpAspLeuGlyLeuGlnSerSerCysThrMet                               495500 505                                                                     GTGGTGCCGTGGATTAGTAATGTGACATACAGACAGACTACACAAGAT2308                           ValValProTrpIleSerAsnValThrTyrArgGlnThrThrGlnAsp                               510515 520                                                                     AGTTTCACTGAGGGCGGATATATCAGCATGTTCTACCAAACAAGAATT2356                           SerPheThrGluGlyGlyTyrIleSerMetPheTyrGlnThrArgIle                               525530535                                                                      GTGGTGCCACTGTCCACCCCTAAGAGTATGAGCATGCTGGGGTTTGTG2404                           ValValProLeuSerThrProLysSerMetSerMetLeuGlyPheVal                               540545550                                                                      TCAG CCTGTAATGATTTCAGTGTGCGATTGCTGCGAGACACCACTCAC2452                          SerAlaCysAsnAspPheSerValArgLeuLeuArgAspThrThrHis                               555560565570                                                                   ATTTCACAATCTGCGCTTCCACAGGGTATTGAAGATTTGACTTCTGAA2500                           IleSerGlnSerAlaLeuProGlnGlyIleGluAspLeuThrSerGlu                               575580 585                                                                     GTTGCACAGGGCGCCCTAACTTTGTCACTCCCGAAGCAACAGGATAGC2548                           ValAlaGlnGlyAlaLeuThrLeuSerLeuProLysGlnGlnAspSer                               590595 600                                                                     TTACCTGATACTAAGGCCAGTGGCCCGGCGCATTCCAAGGAGGTACCT2596                           LeuProAspThrLysAlaSerGlyProAlaHisSerLysGluValPro                               605610615                                                                      GCACTCACTGCAGTCGAGACTGGAGCCACCAATCCTCTGGCACCATCC2644                           AlaLeuThrAlaValGluThrGlyAlaThrAsnProLeuAlaProSer                               620625630                                                                      GACA CAGTTCAAACGCGCCACGTAGTCCAACGACGCAGCAGGTCAGAG2692                          AspThrValGlnThrArgHisValValGlnArgArgSerArgSerGlu                               635640645650                                                                   TCCACAATAGAATCATTCTTCGCACGCGGGGCGTGCGTCGCTATTATT2740                           SerThrIleGluSerPhePheAlaArgGlyAlaCysValAlaIleIle                               655660 665                                                                     GAGGTGGACAATGAACAACCAACCACCCGGGCACAGAAACTATTTGCC2788                           GluValAspAsnGluGlnProThrThrArgAlaGlnLysLeuPheAla                               670675 680                                                                     ATGTGGCGCATTACATACAAAGATACAGTGCAGTTGCGCCGTAAGTTG2836                           MetTrpArgIleThrTyrLysAspThrValGlnLeuArgArgLysLeu                               685690695                                                                      GAGTTTTTCACATACTCTCGTTTTGACATGGAATTCACCTTCGTGGTA2884                           GluPhePheThrTyrSerArgPheAspMetGluPheThrPheValVal                               700705710                                                                      ACCG CCAACTTCACCAACGCTAATAATGGGCATGCACTCAACCAGGTG2932                          ThrAlaAsnPheThrAsnAlaAsnAsnGlyHisAlaLeuAsnGlnVal                               715720725730                                                                   TACCAGATAATGTACATCCCCCCAGGGGCACCCACACCAAAGTCATGG2980                           TyrGlnIleMetTyrIleProProGlyAlaProThrProLysSerTrp                               735740 745                                                                     GACGACTACACTTGGCAAACATCTTCCAACCCGTCCATATTTTACACC3028                           AspAspTyrThrTrpGlnThrSerSerAsnProSerIlePheTyrThr                               750755 760                                                                     TATGGGGCTGCCCCGGCGCGAATCTCAGTGCCATACGTGGGGTTAGCC3076                           TyrGlyAlaAlaProAlaArgIleSerValProTyrValGlyLeuAla                               765770775                                                                      AATGCTTACTCGCACTTTTACGACGGCTTCGCCAAGGTGCCATTGAAG3124                           AsnAlaTyrSerHisPheTyrAspGlyPheAlaLysValProLeuLys                               780785790                                                                      ACAG ATGCCAATGACCAGATTGGTGATTCCTTGTACAGCGCCATGACA3172                          ThrAspAlaAsnAspGlnIleGlyAspSerLeuTyrSerAlaMetThr                               795800805810                                                                   GTTGATGACTTTGGTGTATTGGCAGTTCGTGTTGTCAATGATCACAAC3220                           ValAspAspPheGlyValLeuAlaValArgValValAsnAspHisAsn                               815820 825                                                                     CCCACTAAAGTAACCTCCAAAGTCCGCATTTACATGAAACCCAAACAC3268                           ProThrLysValThrSerLysValArgIleTyrMetLysProLysHis                               830835 840                                                                     GTACGTGTCTGGTGCCCTAGACCGCCGCGCGCGGTACCTTATTATGGA3316                           ValArgValTrpCysProArgProProArgAlaValProTyrTyrGly                               845850855                                                                      CCAGGGGTGGACTATAGGAACAACTTGGACCCCTTATCTGAGAAAGGT3364                           ProGlyValAspTyrArgAsnAsnLeuAspProLeuSerGluLysGly                               860865870                                                                      TTGA CCACATATGGCTTTGGGCATCAGAATAAAGCTGTGTACACTGCT3412                          LeuThrThrTyrGlyPheGlyHisGlnAsnLysAlaValTyrThrAla                               875880885890                                                                   GGTTACAAGATCTGCAACTACCATCTCGCCACTAAGGAGGATTTACAA3460                           GlyTyrLysIleCysAsnTyrHisLeuAlaThrLysGluAspLeuGln                               895900 905                                                                     AATGCTGTAAGCATCATGTGGAATAGAGACCTCTTGGTTGTTGAATCA3508                           AsnAlaValSerIleMetTrpAsnArgAspLeuLeuValValGluSer                               910915 920                                                                     AAAGCTCAAGGTACCGACTCAATAGCAAGGTGCAATTGCAATGCAGGG3556                           LysAlaGlnGlyThrAspSerIleAlaArgCysAsnCysAsnAlaGly                               925930935                                                                      GTGTACTATTGTGAGTCCAGAAGGAAATACTACCCTGTGTCGTTTGTG3604                           ValTyrTyrCysGluSerArgArgLysTyrTyrProValSerPheVal                               940945950                                                                      GGAC CCACCTTCCAATACATGGAGGCTAATGACTACTACCCAGCTAGA3652                          GlyProThrPheGlnTyrMetGluAlaAsnAspTyrTyrProAlaArg                               955960965970                                                                   TACCAATCCCACATGTTAATCGGGCACGGCTTTGCCTCACCAGGTGAC3700                           TyrGlnSerHisMetLeuIleGlyHisGlyPheAlaSerProGlyAsp                               975980 985                                                                     TGTGGTGGTATCCTTAGGTGTCAACATGGCGTCATCGGAATCGTGACA3748                           CysGlyGlyIleLeuArgCysGlnHisGlyValIleGlyIleValThr                               990995 1000                                                                    GCTGGTGGAGAGGGATTAGTCGCATTCTCTGACATAAGGGACTTGTAT3796                           AlaGlyGlyGluGlyLeuValAlaPheSerAspIleArgAspLeuTyr                               10051010101 5                                                                  GCTTACGAGGAAGAGGCCATGGAGCAGGGCATTTCAAACTATATTGAG3844                           AlaTyrGluGluGluAlaMetGluGlnGlyIleSerAsnTyrIleGlu                               102010251030                                                                   TC ACTCGGTGCTGCGTTCGGTAGTGGGTTCACTCAGCAAATAGGGGAT3892                          SerLeuGlyAlaAlaPheGlySerGlyPheThrGlnGlnIleGlyAsp                               1035104010451 050                                                              AAGATATCAGAACTAACCAGCATGGTGACCAGCACGATTACAGAGAAG3940                           LysIleSerGluLeuThrSerMetValThrSerThrIleThrGluLys                               10551060 1065                                                                  CTACTTAAAAACCTAATCAAAATTATTTCATCTCTGGTGATTATCACT3988                           LeuLeuLysAsnLeuIleLysIleIleSerSerLeuValIleIleThr                               10701075 1080                                                                  AGAAATTACGAAGATACCACCACAGTGCTCGCCACTCTAGCTCTTCTT4036                           ArgAsnTyrGluAspThrThrThrValLeuAlaThrLeuAlaLeuLeu                               10851090 1095                                                                  GGGTGTGATGTTTCACCGTGGCAATGGCTGAAGAAGAAAGCATGTGAC4084                           GlyCysAspValSerProTrpGlnTrpLeuLysLysLysAlaCysAsp                               110011051110                                                                   ACTTTGGAGATTCCCTATGTTATTAGACAGGGTGATAGTTGGTTGAAA4132                           ThrLeuGluIleProTyrValIleArgGlnGlyAspSerTrpLeuLys                               111511201125 1130                                                              AAATTTACTGAGGCGTGCAACGCAGCTAAGGGGTTGGAATGGGTGTCC4180                           LysPheThrGluAlaCysAsnAlaAlaLysGlyLeuGluTrpValSer                               11351140 1145                                                                  AACAAAATCTCAAAATTTATTGACTGGTTGAGAGAAAGAATCATCCCA4228                           AsnLysIleSerLysPheIleAspTrpLeuArgGluArgIleIlePro                               11501155 1160                                                                  CAAGCCAGGGACAAGCTTGAGTTTGTAACCAAATTGAAACAGTTGGAA4276                           GlnAlaArgAspLysLeuGluPheValThrLysLeuLysGlnLeuGlu                               11651170 1175                                                                  ATGCTAGAGAATCAGATATCCACAATACACCAATCTTGTCCAAGTCAG4324                           MetLeuGluAsnGlnIleSerThrIleHisGlnSerCysProSerGln                               11801185 1190                                                                  GAACACCAGGAAATTTTGTTCAACAATGTACGCTGGTTGTCCATTCAA4372                           GluHisGlnGluIleLeuPheAsnAsnValArgTrpLeuSerIleGln                               119512001205 1210                                                              TCCAAGAGATTCGCTCCATTGTACGCACTTGAGGCCAAGAGAATACAA4420                           SerLysArgPheAlaProLeuTyrAlaLeuGluAlaLysArgIleGln                               12151220 1225                                                                  AAGTTGGAACACACCATTAATAATTACATACAGTTCAAGAGCAAACAC4468                           LysLeuGluHisThrIleAsnAsnTyrIleGlnPheLysSerLysHis                               12301235 1240                                                                  CGTATTGAGCCAGTATGTTTGTTAGTGCATGGGAGCCCAGGTACAGGA4516                           ArgIleGluProValCysLeuLeuValHisGlySerProGlyThrGly                               12451250 1255                                                                  AAATCAGTTGCGACTAACCTAATTGCTAGAGCCATAGCTGAGAAAGAG4564                           LysSerValAlaThrAsnLeuIleAlaArgAlaIleAlaGluLysGlu                               12601265 1270                                                                  AACACCTCCACCTACTCGCTACCACCGGACCCGTCTCACTTTGATGGA4612                           AsnThrSerThrTyrSerLeuProProAspProSerHisPheAspGly                               127512801285 1290                                                              TACAAACAACAAGGTGTGGTTATCATGGACGACCTAAACCAAAACCCG4660                           TyrLysGlnGlnGlyValValIleMetAspAspLeuAsnGlnAsnPro                               12951 3001305                                                                  GATGGGGCAGATATGAAGCTCTTTTGTCAAATGGTGTCCACTGTGGAG4708                           AspGlyAlaAspMetLysLeuPheCysGlnMetValSerThrValGlu                               1310 13151320                                                                  TTTATCCCACCTATGGCCTCGCTGGAAGAGAAAGGCATTCTGTTCACA4756                           PheIleProProMetAlaSerLeuGluGluLysGlyIleLeuPheThr                               1325133 01335                                                                  TCCAACTATGTTTTAGCCTCCACCAACTCCAGTCGCATCACACCACCT4804                           SerAsnTyrValLeuAlaSerThrAsnSerSerArgIleThrProPro                               13401345 1350                                                                  ACAGTAGCCCACAGTGACGCTCTGGCCAGGAGGTTCGCTTTCGATATG4852                           ThrValAlaHisSerAspAlaLeuAlaArgArgPheAlaPheAspMet                               135513601 3651370                                                              GATATTCAAGTGATGGGCGAGTACTCCAGAGATGGTAAACTCAACATG4900                           AspIleGlnValMetGlyGluTyrSerArgAspGlyLysLeuAsnMet                               1375 13801385                                                                  GCAATGGCTACTGAGACGTGCAAGGACTGCCACCAACCAGCAAACTTC4948                           AlaMetAlaThrGluThrCysLysAspCysHisGlnProAlaAsnPhe                               1390 13951400                                                                  AAAAGATGCTGTCCTTTAGTGTGTGGTAAGGCAATTCAGTTAATGGAC4996                           LysArgCysCysProLeuValCysGlyLysAlaIleGlnLeuMetAsp                               1405 14101415                                                                  AAATCTTCCAGAGTTAGGTACAGTGTTGACCAGATTACTACAATGATT5044                           LysSerSerArgValArgTyrSerValAspGlnIleThrThrMetIle                               14201425 1430                                                                  ATCAACGAGAGAAACAGAAGATCTAACATTGGCAATTGCATGGAGGCT5092                           IleAsnGluArgAsnArgArgSerAsnIleGlyAsnCysMetGluAla                               14351440 14451450                                                              TTGTTCCAAGGACCACTCCAGTACAAAGACCTGAAAATTGACATCAAG5140                           LeuPheGlnGlyProLeuGlnTyrLysAspLeuLysIleAspIleLys                               1455 14601465                                                                  ACGAGGCCCCCCCCTGAATGCATCAATGATCTGCTTCAAGCAGTTGAC5188                           ThrArgProProProGluCysIleAsnAspLeuLeuGlnAlaValAsp                               1470 14751480                                                                  TCCCAGGAAGTGAGGGATTATTGTGAAAAGAAAGGATGGATCGTCAAC5236                           SerGlnGluValArgAspTyrCysGluLysLysGlyTrpIleValAsn                               1485 14901495                                                                  ATCACTAGCCAAGTTCAAACAGAGAGAAACATTAACCGAGCAATGACC5284                           IleThrSerGlnValGlnThrGluArgAsnIleAsnArgAlaMetThr                               1500 15051510                                                                  ATTTTGCAGGCAGTGACAACTTTCGCCGCAGTGGCTGGTGTCGTGTAC5332                           IleLeuGlnAlaValThrThrPheAlaAlaValAlaGlyValValTyr                               15151520 15251530                                                              GTCATGTACAAGTTATTCGCTGGACACCAGGGAGCATACACTGGTCTG5380                           ValMetTyrLysLeuPheAlaGlyHisGlnGlyAlaTyrThrGlyLeu                               1535 15401545                                                                  CCAAACAAAAGACCCAATGTGCCCACCATTAGAGCAGCAAAAGTGCAA5428                           ProAsnLysArgProAsnValProThrIleArgAlaAlaLysValGln                               1550 15551560                                                                  GGGCCTGGGTTTGACTATGCAGTGGCTATGGCTAAAAGAAACATTGTT5476                           GlyProGlyPheAspTyrAlaValAlaMetAlaLysArgAsnIleVal                               1565 15701575                                                                  ACAGCAACTACTAGCAAAGGGGAGTTCACAATGCTAGGAGTCCACGAC5524                           ThrAlaThrThrSerLysGlyGluPheThrMetLeuGlyValHisAsp                               1580 15851590                                                                  AACGTGGCCATTTTACCAACTCATGCCTCACCTGGTGAGAGTATTGTA5572                           AsnValAlaIleLeuProThrHisAlaSerProGlyGluSerIleVal                               15951600 16051610                                                              ATTGATGGCAAAGAGGTTGAAATCCTAGACGCTAAAGCCCTCGAAGAT5620                           IleAspGlyLysGluValGluIleLeuAspAlaLysAlaLeuGluAsp                               1 61516201625                                                                  CAGGCAGGCACTAATCTGGAAATCACCATAATAACCCTCAAAAGAAAT5668                           GlnAlaGlyThrAsnLeuGluIleThrIleIleThrLeuLysArgAsn                                163016351640                                                                  GAAAAGTTCAGAGATATCAGACAACACATACCCACTCAAATCACCGAG5716                           GluLysPheArgAspIleArgGlnHisIleProThrGlnIleThrGlu                               164 516501655                                                                  ACGAATGATGGAGTTCTGATTGTGAACACTAGTAAGTACCCCAACATG5764                           ThrAsnAspGlyValLeuIleValAsnThrSerLysTyrProAsnMet                               1660 16651670                                                                  TATGTTCCTGTCGGTGCTGTGACTGAGCAGGGATACCTAAATCTCGGT5812                           TyrValProValGlyAlaValThrGluGlnGlyTyrLeuAsnLeuGly                               16751 68016851690                                                              GGGCGCCAGACTGCTCGTATTCTAATGTACAACTTTCCAACCAGAGCT5860                           GlyArgGlnThrAlaArgIleLeuMetTyrAsnPheProThrArgAla                                169517001705                                                                  GGTCAGTGTGGTGGAGTCATCACATGCACTGGGAAAGTCATCGGGATG5908                           GlyGlnCysGlyGlyValIleThrCysThrGlyLysValIleGlyMet                                171017151720                                                                  CACGTTGGTGGGAATGGTTCACATGGGTTTGCAGCGGCCCTGAAGCGG5956                           HisValGlyGlyAsnGlySerHisGlyPheAlaAlaAlaLeuLysArg                                172517301735                                                                  TCATACTTCACTCAGAGCCAAGGTGAAATCCAGTGGATGAGACCATCA6004                           SerTyrPheThrGlnSerGlnGlyGluIleGlnTrpMetArgProSer                               1740 17451750                                                                  AAGGAGGCAGGGTATCCAATTATAAACGCCCCAACCAAGACCAAGCTC6052                           LysGluAlaGlyTyrProIleIleAsnAlaProThrLysThrLysLeu                               1755 176017651770                                                              GAGCCCAGTGCTTTCCACTATGTGTTTGAAGGAGTAAAGGAACCAGCA6100                           GluProSerAlaPheHisTyrValPheGluGlyValLysGluProAla                                177517801785                                                                  GTCCTCACAAAGAATGATCCCAGACTCAAAACAGACTTTGAAGAAGCA6148                           ValLeuThrLysAsnAspProArgLeuLysThrAspPheGluGluAla                                179017951800                                                                  ATCTTCTCTAAGTATGTAGGGAACAAGATCACTGAGGTGGATGAGTAC6196                           IlePheSerLysTyrValGlyAsnLysIleThrGluValAspGluTyr                                180518101815                                                                  ATGAAAGAGGCAGTGGACCATTATGCTGGACAACTTATGTCGCTGGAT6244                           MetLysGluAlaValAspHisTyrAlaGlyGlnLeuMetSerLeuAsp                                182018251830                                                                  ATCAGCACAGAGCAAATGTGTCTAGAAGACGCCATGTATGGTACTGAT6292                           IleSerThrGluGlnMetCysLeuGluAspAlaMetTyrGlyThrAsp                               1835 184018451850                                                              GGTCTGGAGGCGCTAGATCTGTCTACCAGTGCCGGGTACCCCTACGTG6340                           GlyLeuGluAlaLeuAspLeuSerThrSerAlaGlyTyrProTyrVal                                185518601865                                                                  GCAATGGGGAAGAAGAAGAGAGATATCCTAAACAAGCAAACCAGAGAC6388                           AlaMetGlyLysLysLysArgAspIleLeuAsnLysGlnThrArgAsp                               187018751880                                                                   ACCAAAGAAATGCAAAGACTTTTGGACGCTTACGGAATCAACCTACCA6436                           ThrLysGluMetGlnArgLeuLeuAspAlaTyrGlyIleAsnLeuPr o                              188518901895                                                                   TTAGTGACATATGTCAAGGACGAGCTGAGGTCCAAAACAAAAGTGGAA6484                           LeuValThrTyrValLysAspGluLeuArgSerLysThrLysValGlu                                190019051910                                                                  CAGGGAAAATCCAGACTGATTGAAGCTTCCAGTCTAAATGACTCAGTG6532                           GlnGlyLysSerArgLeuIleGluAlaSerSerLeuAsnAspSerVal                               1915 192019251930                                                              GCCATGAGAATGGCATTTGGAAACCTTTATGCAGCATTCCACAGGAAT6580                           AlaMetArgMetAlaPheGlyAsnLeuTyrAlaAlaPheHisArgAsn                               193519401945                                                                   CCAGGGGTCGTCACTGGTAGTGCAGTTGGATGCGATCCAGACCTATTC6628                           ProGlyValValThrGlySerAlaValGlyCysAspProAspLe uPhe                              195019551960                                                                   TGGAGCAAGATCCCAGTGTTGATGGAAGAAAAGCTATTTGCCTTTGAT6676                           TrpSerLysIleProValLeuMetGluGluLysLeuPheAlaP heAsp                              196519701975                                                                   TACACAGGATACGACGCATCACTTAGCCCAGCTTGGTTTGAGGCACTC6724                           TyrThrGlyTyrAspAlaSerLeuSerProAlaTrpPheGluAla Leu                              198019851990                                                                   AAGATGGTGTTAGAGAAAATTGGTTTTGGAGATAGAGTGGATTACATA6772                           LysMetValLeuGluLysIleGlyPheGlyAspArgValAspTyrIle                               1 995200020052010                                                              GACTACCTTAACCATTCACACCACTTGTACAAAAACAAGATATATTGT6820                           AspTyrLeuAsnHisSerHisHisLeuTyrLysAsnLysIleTy rCys                              201520202025                                                                   GTTAAGGGCGGCATGCCATCTGGCTGCTCCGGCACTTCAATTTTTAAT6868                           ValLysGlyGlyMetProSerGlyCysSerGlyThrSerI lePheAsn                              203020352040                                                                   TCAATGATTAACAATTTGATCATTAGGACGCTTTTACTGAAAACCTAC6916                           SerMetIleAsnAsnLeuIleIleArgThrLeuLeuLeu LysThrTyr                              204520502055                                                                   AAGGGCATAGATTTGGACCACTTAAAAATGATTGCCTATGGTGACGAT6964                           LysGlyIleAspLeuAspHisLeuLysMetIleAlaTyrGly AspAsp                              206020652070                                                                   GTAATAGCTTCCTATCCCCATGAGGTTGACGCTAGTCTCCTAGCCCAA7012                           ValIleAlaSerTyrProHisGluValAspAlaSerLeuLeuAlaGl n                              2075208020852090                                                               TCAGGAAAAGACTATGGACTAACCATGACTCCGGCAGATAAATCTGCC7060                           SerGlyLysAspTyrGlyLeuThrMetThrProAlaAspL ysSerAla                              209521002105                                                                   ACTTTTGAGACAGTCACATGGGAGAATGTAACTTTCTTGAAAAGATTC7108                           ThrPheGluThrValThrTrpGluAsnValThrPhe LeuLysArgPhe                              211021152120                                                                   TTCAGAGCAGATGAGAAATACCCCTTCCTCATACATCCAGTAATGCCA7156                           PheArgAlaAspGluLysTyrProPheLeuIleHis ProValMetPro                              212521302135                                                                   ATGAAGGAAATTCATGAATCAATCAGATGGACAAAAGATCCTCGGAAT7204                           MetLysGluIleHisGluSerIleArgTrpThrLysAs pProArgAsn                              214021452150                                                                   ACGCAGGACCATGTACGCTCCTTGTGTCTATTGGCTTGGCACAACGGG7252                           ThrGlnAspHisValArgSerLeuCysLeuLeuAlaTrpHisA snGly                              2155216021652170                                                               GAAGAAGAATACAACAAATTTTTAGCTAAAATTAGGAGTGTGCCAATC7300                           GluGluGluTyrAsnLysPheLeuAlaLysIleArg SerValProIle                              217521802185                                                                   GGAAGAGCTTTGTTGCTCCCAGAGTACTCAACATTGTACCGCCGTTGG7348                           GlyArgAlaLeuLeuLeuProGluTyrSerThr LeuTyrArgArgTrp                              219021952200                                                                   CTTGACTCATTTTAGTAACCCTACCTCAGTCGAATTGGATTGGGTCATACTGT7401                      LeuAspSerPhe                                                                   2205                                                                           TGTAGGGG TAAATTTTTCTTTAATTCGGAGG7432                                           (2) INFORMATION FOR SEQ ID NO:2:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 2206 amino acids                                                   (B) TYPE: amino acid                                                           (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: protein                                                    (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                        MetGlyAlaGln ValSerSerGlnLysValGlyAlaHisGluAsnSer                              151015                                                                         AsnArgAlaTyrGlyGlySerThrIleAsnTyrThrThrIleAsnTyr                               20 2530                                                                        TyrLysAspSerAlaSerAsnAlaAlaSerLysGlnAspTyrSerGln                               354045                                                                         AspProSerLysPheThrGluProLeuLysA spValLeuIleLysThr                              505560                                                                         AlaProAlaLeuAsnSerProAsnValGluAlaCysGlyTyrSerAsp                               657075 80                                                                      ArgValLeuGlnLeuThrLeuGlyAsnSerThrIleThrThrGlnGlu                               859095                                                                         AlaAlaAsnSerValValAlaTyrGlyArgTrpProGluPheIleAr g                              100105110                                                                      AspAspGluAlaAsnProValAspGlnProThrGluProAspValAla                               115120125                                                                      ThrCysArgPhe TyrThrLeuAspThrValMetTrpGlyLysGluSer                              130135140                                                                      LysGlyTrpTrpTrpLysLeuProAspAlaLeuArgAspMetGlyLeu                               145150 155160                                                                  PheGlyGlnAsnMetTyrTyrHisTyrLeuGlyArgSerGlyTyrThr                               165170175                                                                      ValHisValGlnCysAsnAlaSerLysP heHisGlnGlyAlaLeuGly                              180185190                                                                      ValPheAlaIleProGluTyrCysLeuAlaGlyAspSerAspLysGln                               195200 205                                                                     ArgTyrThrSerTyrAlaAsnAlaAsnProGlyGluArgGlyGlyLys                               210215220                                                                      PheTyrSerGlnPheAsnLysAspAsnAlaValThrSerProLysArg                               225 230235240                                                                  GluPheCysProValAspTyrLeuLeuGlyCysGlyValLeuLeuGly                               245250255                                                                      AsnAlaPhe ValTyrProHisGlnIleIleAsnLeuArgThrAsnAsn                              260265270                                                                      SerAlaThrIleValLeuProTyrValAsnAlaLeuAlaIleAspSer                               275 280285                                                                     MetValLysHisAsnAsnTrpGlyIleAlaIleLeuProLeuSerPro                               290295300                                                                      LeuAspPheAlaGlnAspSerSerValGluIleProI leThrValThr                              305310315320                                                                   IleAlaProMetCysSerGluPheAsnGlyLeuArgAsnValThrAla                               325330 335                                                                     ProLysPheGlnGlyLeuProValLeuAsnThrProGlySerAsnGln                               340345350                                                                      TyrLeuThrSerAspAsnHisGlnSerProCysAlaIleProGluPh e                              355360365                                                                      AspValThrProProIleAspIleProGlyGluValLysAsnMetMet                               370375380                                                                      GluLeuAlaGluIleAsp ThrMetIleProLeuAsnLeuGluSerThr                              385390395400                                                                   LysArgAsnThrMetAspMetTyrArgValThrLeuSerAspSerAla                               405 410415                                                                     AspLeuSerGlnProIleLeuCysLeuSerLeuSerProAlaPheAsp                               420425430                                                                      ProArgLeuSerHisThrMetLeuGlyG luValLeuAsnTyrTyrThr                              435440445                                                                      HisTrpAlaGlySerLeuLysPheThrPheLeuPheCysGlySerMet                               450455460                                                                       MetAlaThrGlyLysIleLeuValAlaTyrAlaProProGlyAlaGln                              465470475480                                                                   ProProThrSerArgLysGluAlaMetLeuGlyThrHisValIleTrp                                485490495                                                                     AspLeuGlyLeuGlnSerSerCysThrMetValValProTrpIleSer                               500505510                                                                      AsnValThr TyrArgGlnThrThrGlnAspSerPheThrGluGlyGly                              515520525                                                                      TyrIleSerMetPheTyrGlnThrArgIleValValProLeuSerThr                               5305 35540                                                                     ProLysSerMetSerMetLeuGlyPheValSerAlaCysAsnAspPhe                               545550555560                                                                   SerValArgLeuLeuArgAspThrThrHisI leSerGlnSerAlaLeu                              565570575                                                                      ProGlnGlyIleGluAspLeuThrSerGluValAlaGlnGlyAlaLeu                               580585 590                                                                     ThrLeuSerLeuProLysGlnGlnAspSerLeuProAspThrLysAla                               595600605                                                                      SerGlyProAlaHisSerLysGluValProAlaLeuThrAlaValGlu                                610615620                                                                     ThrGlyAlaThrAsnProLeuAlaProSerAspThrValGlnThrArg                               625630635640                                                                   HisValValGln ArgArgSerArgSerGluSerThrIleGluSerPhe                              645650655                                                                      PheAlaArgGlyAlaCysValAlaIleIleGluValAspAsnGluGln                               660 665670                                                                     ProThrThrArgAlaGlnLysLeuPheAlaMetTrpArgIleThrTyr                               675680685                                                                      LysAspThrValGlnLeuArgArgLysLeuG luPhePheThrTyrSer                              690695700                                                                      ArgPheAspMetGluPheThrPheValValThrAlaAsnPheThrAsn                               705710715 720                                                                  AlaAsnAsnGlyHisAlaLeuAsnGlnValTyrGlnIleMetTyrIle                               725730735                                                                      ProProGlyAlaProThrProLysSerTrpAspAspTyrThrTrpGl n                              740745750                                                                      ThrSerSerAsnProSerIlePheTyrThrTyrGlyAlaAlaProAla                               755760765                                                                      ArgIleSerVal ProTyrValGlyLeuAlaAsnAlaTyrSerHisPhe                              770775780                                                                      TyrAspGlyPheAlaLysValProLeuLysThrAspAlaAsnAspGln                               785790 795800                                                                  IleGlyAspSerLeuTyrSerAlaMetThrValAspAspPheGlyVal                               805810815                                                                      LeuAlaValArgValValAsnAspHisA snProThrLysValThrSer                              820825830                                                                      LysValArgIleTyrMetLysProLysHisValArgValTrpCysPro                               835840 845                                                                     ArgProProArgAlaValProTyrTyrGlyProGlyValAspTyrArg                               850855860                                                                      AsnAsnLeuAspProLeuSerGluLysGlyLeuThrThrTyrGlyPhe                               865 870875880                                                                  GlyHisGlnAsnLysAlaValTyrThrAlaGlyTyrLysIleCysAsn                               885890895                                                                      TyrHisLeu AlaThrLysGluAspLeuGlnAsnAlaValSerIleMet                              900905910                                                                      TrpAsnArgAspLeuLeuValValGluSerLysAlaGlnGlyThrAsp                               915 920925                                                                     SerIleAlaArgCysAsnCysAsnAlaGlyValTyrTyrCysGluSer                               930935940                                                                      ArgArgLysTyrTyrProValSerPheValGlyProT hrPheGlnTyr                              945950955960                                                                   MetGluAlaAsnAspTyrTyrProAlaArgTyrGlnSerHisMetLeu                               965970 975                                                                     IleGlyHisGlyPheAlaSerProGlyAspCysGlyGlyIleLeuArg                               980985990                                                                      CysGlnHisGlyValIleGlyIleValThrAlaGlyGlyGluGlyLe u                              99510001005                                                                    ValAlaPheSerAspIleArgAspLeuTyrAlaTyrGluGluGluAla                               101010151020                                                                   MetGluGlnGlyIleSe rAsnTyrIleGluSerLeuGlyAlaAlaPhe                              1025103010351040                                                               GlySerGlyPheThrGlnGlnIleGlyAspLysIleSerGluLeuThr                               1045 10501055                                                                  SerMetValThrSerThrIleThrGluLysLeuLeuLysAsnLeuIle                               106010651070                                                                   LysIleIleSerSerLeuValIle IleThrArgAsnTyrGluAspThr                              107510801085                                                                   ThrThrValLeuAlaThrLeuAlaLeuLeuGlyCysAspValSerPro                               10901095 1100                                                                  TrpGlnTrpLeuLysLysLysAlaCysAspThrLeuGluIleProTyr                               1105111011151120                                                               ValIleArgGlnGlyAspSerTrpLeuLysLysPheThrGluAl aCys                              112511301135                                                                   AsnAlaAlaLysGlyLeuGluTrpValSerAsnLysIleSerLysPhe                               114011451150                                                                   Il eAspTrpLeuArgGluArgIleIleProGlnAlaArgAspLysLeu                              115511601165                                                                   GluPheValThrLysLeuLysGlnLeuGluMetLeuGluAsnGlnIle                               1170 11751180                                                                  SerThrIleHisGlnSerCysProSerGlnGluHisGlnGluIleLeu                               1185119011951200                                                               PheAsnAsnValArgTrpLeu SerIleGlnSerLysArgPheAlaPro                              120512101215                                                                   LeuTyrAlaLeuGluAlaLysArgIleGlnLysLeuGluHisThrIle                               1220 12251230                                                                  AsnAsnTyrIleGlnPheLysSerLysHisArgIleGluProValCys                               123512401245                                                                   LeuLeuValHisGlySerProGlyThrGlyLysSerVa lAlaThrAsn                              125012551260                                                                   LeuIleAlaArgAlaIleAlaGluLysGluAsnThrSerThrTyrSer                               1265127012751280                                                               LeuProProAspProSerHisPheAspGlyTyrLysGlnGlnGlyVal                               128512901295                                                                   ValIleMetAspAspLeuAsnGlnAsnProAspGlyAlaAspMetLys                                130013051310                                                                  LeuPheCysGlnMetValSerThrValGluPheIleProProMetAla                               131513201325                                                                   SerLeuGluGluLys GlyIleLeuPheThrSerAsnTyrValLeuAla                              133013351340                                                                   SerThrAsnSerSerArgIleThrProProThrValAlaHisSerAsp                               13451350 13551360                                                              AlaLeuAlaArgArgPheAlaPheAspMetAspIleGlnValMetGly                               136513701375                                                                   GluTyrSerArgAspGlyLysLeuAsnMe tAlaMetAlaThrGluThr                              138013851390                                                                   CysLysAspCysHisGlnProAlaAsnPheLysArgCysCysProLeu                               13951400 1405                                                                  ValCysGlyLysAlaIleGlnLeuMetAspLysSerSerArgValArg                               141014151420                                                                   TyrSerValAspGlnIleThrThrMetIleIleAsnGluArgAsnArg                               1425 143014351440                                                              ArgSerAsnIleGlyAsnCysMetGluAlaLeuPheGlnGlyProLeu                               144514501455                                                                   GlnTyr LysAspLeuLysIleAspIleLysThrArgProProProGlu                              146014651470                                                                   CysIleAsnAspLeuLeuGlnAlaValAspSerGlnGluValArgAsp                               1475 14801485                                                                  TyrCysGluLysLysGlyTrpIleValAsnIleThrSerGlnValGln                               149014951500                                                                   ThrGluArgAsnIleAsnArgAlaMetThrIl eLeuGlnAlaValThr                              1505151015151520                                                               ThrPheAlaAlaValAlaGlyValValTyrValMetTyrLysLeuPhe                               15251530 1535                                                                  AlaGlyHisGlnGlyAlaTyrThrGlyLeuProAsnLysArgProAsn                               154015451550                                                                   ValProThrIleArgAlaAlaLysValGlnGlyProGly PheAspTyr                              155515601565                                                                   AlaValAlaMetAlaLysArgAsnIleValThrAlaThrThrSerLys                               157015751580                                                                   GlyGluPhe ThrMetLeuGlyValHisAspAsnValAlaIleLeuPro                              1585159015951600                                                               ThrHisAlaSerProGlyGluSerIleValIleAspGlyLysGluVal                                160516101615                                                                  GluIleLeuAspAlaLysAlaLeuGluAspGlnAlaGlyThrAsnLeu                               162016251630                                                                   GluIleThrIleIleTh rLeuLysArgAsnGluLysPheArgAspIle                              163516401645                                                                   ArgGlnHisIleProThrGlnIleThrGluThrAsnAspGlyValLeu                               16501655 1660                                                                  IleValAsnThrSerLysTyrProAsnMetTyrValProValGlyAla                               1665167016751680                                                               ValThrGluGlnGlyTyrLeuAsnLeuGlyGlyArg GlnThrAlaArg                              168516901695                                                                   IleLeuMetTyrAsnPheProThrArgAlaGlyGlnCysGlyGlyVal                               17001705 1710                                                                  IleThrCysThrGlyLysValIleGlyMetHisValGlyGlyAsnGly                               171517201725                                                                   SerHisGlyPheAlaAlaAlaLeuLysArgSerTyrPheThrGlnSer                                173017351740                                                                  GlnGlyGluIleGlnTrpMetArgProSerLysGluAlaGlyTyrPro                               1745175017551760                                                               IleIleAsnAlaPr oThrLysThrLysLeuGluProSerAlaPheHis                              176517701775                                                                   TyrValPheGluGlyValLysGluProAlaValLeuThrLysAsnAsp                               1780 17851790                                                                  ProArgLeuLysThrAspPheGluGluAlaIlePheSerLysTyrVal                               179518001805                                                                   GlyAsnLysIleThrGluValAspGluTyr MetLysGluAlaValAsp                              181018151820                                                                   HisTyrAlaGlyGlnLeuMetSerLeuAspIleSerThrGluGlnMet                               182518301835 1840                                                              CysLeuGluAspAlaMetTyrGlyThrAspGlyLeuGluAlaLeuAsp                               184518501855                                                                   LeuSerThrSerAlaGlyTyrProTyrValAlaMetGlyLysLy sLys                              186018651870                                                                   ArgAspIleLeuAsnLysGlnThrArgAspThrLysGluMetGlnArg                               187518801885                                                                   LeuLeuAs pAlaTyrGlyIleAsnLeuProLeuValThrTyrValLys                              189018951900                                                                   AspGluLeuArgSerLysThrLysValGluGlnGlyLysSerArgLeu                               19051910 19151920                                                              IleGluAlaSerSerLeuAsnAspSerValAlaMetArgMetAlaPhe                               192519301935                                                                   GlyAsnLeuTyrAlaAlaPhe HisArgAsnProGlyValValThrGly                              194019451950                                                                   SerAlaValGlyCysAspProAspLeuPheTrpSerLysIleProVal                               19551960 1965                                                                  LeuMetGluGluLysLeuPheAlaPheAspTyrThrGlyTyrAspAla                               197019751980                                                                   SerLeuSerProAlaTrpPheGluAlaLeuLysMetValLeuGluLy s                              1985199019952000                                                               IleGlyPheGlyAspArgValAspTyrIleAspTyrLeuAsnHisSer                               200520102015                                                                   HisHisLeuTyrLysAsnLysIleTyrCysValLysGlyGlyMetPro                               202020252030                                                                   SerGlyCysSerGlyThrSerIlePheAsnSerMetIleAsnAsnLeu                                203520402045                                                                  IleIleArgThrLeuLeuLeuLysThrTyrLysGlyIleAspLeuAsp                               205020552060                                                                   HisLeuLysMetIleAlaTyrGly AspAspValIleAlaSerTyrPro                              2065207020752080                                                               HisGluValAspAlaSerLeuLeuAlaGlnSerGlyLysAspTyrGly                               2085 20902095                                                                  LeuThrMetThrProAlaAspLysSerAlaThrPheGluThrValThr                               210021052110                                                                   TrpGluAsnValThrPheLeuLysArgPhePh eArgAlaAspGluLys                              211521202125                                                                   TyrProPheLeuIleHisProValMetProMetLysGluIleHisGlu                               213021352140                                                                   Se rIleArgTrpThrLysAspProArgAsnThrGlnAspHisValArg                              2145215021552160                                                               SerLeuCysLeuLeuAlaTrpHisAsnGlyGluGluGluTyrAsnLys                                216521702175                                                                  PheLeuAlaLysIleArgSerValProIleGlyArgAlaLeuLeuLeu                               218021852190                                                                   ProGluTyr SerThrLeuTyrArgArgTrpLeuAspSerPhe                                    219522002205                                                                   (2) INFORMATION FOR SEQ ID NO:3:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 30 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                        CGCCTCAGTAAATTTTTTCAACCAACTATC30                                               (2) INFORMATION FOR SEQ ID NO:4:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                        GGCGTATCTGACAAGGG17                                                            (2) INFORMATION FOR SEQ ID NO:5:                                               (i ) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 45 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                        CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTG 45                               (2) INFORMATION FOR SEQ ID NO:6:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 19 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                        GAATCATGGTGTCTATC TC19                                                         (2) INFORMATION FOR SEQ ID NO:7:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                        CTCGTTTGTGGCATAAC17                                                            (2) INFORMATION FOR SEQ ID NO:8:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 17 base pairs                                                      (B) TYPE: nucleic acid                                                         (C) STRANDEDNESS: single                                                       (D) TOPOLOGY: linear                                                            (ii) MOLECULE TYPE: DNA (genomic)                                             (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                        CCCATGGGGATGCACCG17                                                            (2) INFORMATION FOR SEQ ID NO:9:                                               (i) SEQUENCE CHARACTERISTICS:                                                  (A) LENGTH: 46 base pairs                                                      (B) TYPE: nucleic acid                                                          (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                           (ii) MOLECULE TYPE: DNA (genomic)                                              (iii) HYPOTHETICAL: NO                                                         (iv) ANTI-SENSE: NO                                                            (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                        CTGCAGTAATACGACTCACTATAGGTTAAAACAGCTCTGGGGTTTG46                               __________________________________________________________________________ 

What is claimed is:
 1. An RNA virus cDNA having the sequence shown in FIG. 6 (SEQ ID NO: 1).
 2. An RNA virus cDNA which codes for a polypeptide having the sequence shown in FIG. 6 (SEQ ID NO: 2). 